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INTRODUCTION 



Technical Field 

The present invention relates to methods and compounds for modulating nuclear receptor 
coactivator binding. 

Background 

25 Cells contain receptors that can elicit a biological response by binding various molecules 

including proteins, hormones and/or drugs. Nuclear receptors represent a super family of proteins 
that are hormone/ligand-activated transcription factors that enhance or repress transcription in a cell 
type-, ligand- and promoter-dependent manner. The nuclear receptor family includes receptors for 
glucocorticoids (GRs), androgens (ARs), mineralocorticoids (MRs), progestins (PRs), estrogens 
(ERs), thyroid hormones (TRs), vitamin D (VDRs), retinoids (RARs and RXRs), peroxisomes 
(XPARs and PPARs) and icosanoids (IRs). The so-called "orphan receptors" for which ligands 
have not been identified are also part of the nuclear receptor superfamily, as they are structurally 
homologous to the classic nuclear receptors, such as steroid and thyroid receptors. 
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5 Although overall sequence conservation between nuclear receptors varies between different families 
of receptors, sequence conservation between functional regions, or modules, of the receptors is 
high. For example, nuclear receptors can be organized into functional modules comprising an N- 
terminal transcriptional activation domain, a central DNA binding domain (DBD), and a C-terminal 
ligand binding domain (LBD). The LBD of nuclear receptors represents a hormone/ligand- 
1 0 dependent molecular switch. Binding of hormone to a nuclear receptor's LBD changes its ability to 
modulate transcription of DNA, although they may have transcription-independent actions. Nuclear 
receptors also bind proteins, such as chaperone complexes, corepressors, or coactivators, that are 
involved in receptor function. Hormone binding by a nuclear receptor can increase or decrease 
binding affinity to these proteins, and can influence or mediate the multiple actions of the nuclear 
receptors on transcription. For example, nuclear receptors can stimulate transcription in response to 
hormone binding by recruiting coactivator proteins to promoters of responsive genes (Glass et al., 
Curr. Opin. Cell Biol. (1997) 9:222-32); and Horwitz et al., Mol. Endocrinol. (1996) 70:1 167-77). 

Coactivators of the pi 60 family mediate activity of a transcriptional activation domain, 
called AF2, that is part of the nuclear receptor's LBD. A few receptor mutants deficient in 
coactivator-dependent activation have been isolated (TR: Collingwood et al. Proc. Natl. Acad. Sci. 
(1997) 94:248-253; VDR: Jurutka et al., J. Biol. Chem. (1997) 227:14592-14599, Masayama et al., 
Mol. Endocrinol. (1997) 77:1507-1517; ER and RAR: Henttu et al., Mol. Cell Biol. (1997) 77:1832- 
1839). While these studies support the physiological relevance of the observed interaction, the 
structural and functional nature of the site to which coactivators bind has not been defined. 
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The medical importance of nuclear receptors is significant. They have been implicated in 
breast cancer, prostate cancer, cardiac arrhythmia, infertility, osteoporosis, hyperthyroidism, 
hypercholesterolemia, obesity and other conditions. However, limited treatments are available and 
current agonist/antagonist drugs used to target nuclear receptors are ligands that bind to the 
receptor's LBD buried deep within the receptor. Although additional targets on nuclear receptors 
are desired for drug development, the structural and functional basis of such sites, including the 
coactivator binding site, has not been described. 

Accordingly, a need exists for identification and characterization of the coactivator binding 
sites of nuclear receptors, and molecules that affect their interaction with cellular coactivator 
proteins. This would provide a major new target for iterative drug design, synthesis, and selection. 
It also would be advantageous to devise methods and compositions for reducing the time required to 
discover compounds that target the coactivator binding site of nuclear receptors and administer 
them to organisms to modulate physiological processes regulated by nuclear receptors. 
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5 Relevant Literature 

Wagner et al., {Nature (1995) 375:690-697) disclose the crystal structure of rat TR-alpha 
LBD. Various references disclose mutations in carboxyl-terminal helices of nuclear receptors 
(Henttu et al., supra : O'Donnell et al., Mol Endocrinol. (1991) 5:94-99; Whitfield et al., Mol 
Endocrinol (1995) 9:1 166-79; Saatcioglu et al., Mol Cell Biol (1997) 77:4687-95; Collingwood et 
10 al., supra : Kamei et al., Cell (1996) 55:403-14). Hong et al. (Proc. Natl Acad ScL USA (1996) 
95( 10):498-49452) and Hong et al. (Mol Cell Biol. (1997) 77:2735-2744) disclose cloning and 
expression of GRIP1 coactivator. Torchia et al., (Nature (1997) 557:677-84), Le Douarin et al., 
(EMBO J (1996) 75:6701-6715) and Heery et al. (Nature (1997) 557:733-736) disclose sequence 
alignment of various coactivator proteins showing a (SEQ ID NO: 1) LxxLL motif. 

15 

SUMMARY OF THE INVENTION 

The present invention relates to identification and manipulation of the coactivator binding 
site of nuclear receptors. Identification of this site permits design and obtention of compounds that 
bind to the coactivator binding site of nuclear receptors and modulate coactivator binding to the 
20 receptor. The compounds include agonists and antagonists that modulate nuclear receptor activity 
by promoting (agonists) or blocking (antagonists) hormone-dependent coactivator binding to the 
receptor, particularly antagonists. The compounds of the invention can be receptor-, cell- and/or 
tissue-specific. 

The present invention also includes protein cocrystals of nuclear receptors with a molecule 
25 bound to the coactivator binding site and methods for making them. The cocrystals provide means 
to obtain atomic modeling information of the specific amino acids and their atoms forming the 
coactivator binding site and that interact with molecules that bind to the site, such as coactivator. 
The cocrystals also provide modeling information regarding the coactivatonnuclear receptor 
interaction, as well as the structure of coactivators bound thereto. 

30 The present invention further provides methods for identifying and designing small 

molecules that bind to the coactivator binding site using atomic models of nuclear receptors. The 
method involves modeling test compounds that fit spacially into a nuclear receptor coactivator 
binding site of interest using an atomic structural model comprising a nuclear receptor coactivator 
binding site or portion thereof, screening the test compounds in a biological assay characterized by 
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binding of a test compound to a nuclear receptor coactivator binding site, and identifying a test 
compound that modulates coactivator binding to the nuclear receptor. 

The invention also includes compositions and methods for identifying coactivator binding 
sites of nuclear receptors. The methods involve examining the surface of a nuclear receptor of 
interest to identify residues that modulate coactivator binding. The residues can be identified by 
homology to the coactivator binding site of human TR described herein. Overlays and 
superpositioning with a three dimensional model of a nuclear receptor LBD, or a portion thereof 
that contains a coactivator binding site, also can be used for this purpose. Additionally, alignment 
and/or modeling can be used as a guide for the placement of mutations on the LBD surface to 
characterize the nature of the site in the context of a cell. 

Also provided is a method of modulating the activity of a nuclear receptor. The method can 
be in vitro or in vivo. The method comprises administering, in vitro or in vivo, a sufficient amount 
of a compound that binds to the coactivator binding site. Preferred compounds bind to the site with 
greater affinity than coactivator proteins found in a cell of interest. Binding at this site, the 
compound can compete for binding of coactivator proteins, thereby inhibiting gene transcription, or 
in some cases promoting it, even when hormone is or is not bound. 

The invention further includes a method for identifying an agonist or antagonist of 
coactivator binding to a nuclear receptor. The method comprises providing the atomic coordinates 
comprising a nuclear receptor coactivator binding site or portion thereof to a computerized 
modeling system; modeling compounds which fit spacially into the nuclear receptor coactivator 
binding site; and identifying in an assay for nuclear receptor activity a compound that increases or 
decreases activity of the nuclear receptor through binding the coactivator binding site, 

Also provided is a machine-readable data storage medium with information for constructing 
and manipulating an atomic model comprising a coactivator binding site or portion thereof. The 
medium comprises a data storage material encoded with machine readable data which, when using a 
30 machine programmed with instructions for using said data, is capable of displaying a graphical 
three-dimensional representation of a molecule or molecular complex for a nuclear receptor 
coactivator binding site. 

Also provided is a method of identifying a compound that selectively modulates the activity 
of one type of nuclear receptor compared to other nuclear receptors. The method is exemplified by 
35 modeling test compounds that fit spacially and preferentially into a nuclear receptor coactivator 
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5 binding site of interest using an atomic structural model of a nuclear receptor coactivator binding 
site, selecting a compound that interacts with one or more residues of the coactivator binding site 
unique in the context of that site, and identifying in an assay for coactivator binding activity a 
compound that selectively binds to the coactivator binding site compared to other nuclear receptors. 
The unique features involved in receptor-selective coactivator binding can be identified by 

10 comparing atomic models of different receptors or isoforms of the same type of receptor. 

The invention finds use in the selection and characterization of peptide, peptidomimetic, as 
well as other small molecule compounds, such as small organic molecules, identified by the 
methods of the invention, particularly new lead compounds useful in treating nuclear receptor-based 
disorders. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the specific effects of mutations on hTRpl transcriptional activation in 
HeLa cells and correlation with effects on binding to GST-GRIP 1 . T 3 dependent activation of 
transcription of a reporter gene, expressed as the percentage of WT is plotted for each mutant. 
20 GST-GRIP 1 binding, analyzed by autoradiography after separation using 10% SDS-PAGE, was 
also expressed as the percentage of WT and plotted for each mutant. The GST-GRIP 1 used 
included GRIP1 amino acids 721-1 121; the same results were obtained using a GST-GRIP 1 
construct including GRIP1 amino acids 563-1 121 (data not shown). 

Figure 2 shows that overexpression of full-length GRIP1 rescues loss of transcriptional 
25 activation by hTRpl mutants. Indicated amounts of the expression vector for full-length GRIP1, 
pSG5-GRIPl, is included in the cotransfections, which otherwise are performed as in Figure 1. 
The WT or different representative hTRpi mutants are indicated. 

Figure 3 shows specific hERct surface mutants cause loss of transcriptional activation in 
HeLa cells in parallel with their loss of in vitro GRIP1 binding. The fold E 2 activation, expressed as 
30 the percentage of WT, and the phosphorimager quantitation of in vitro binding of [ J>5 S]-labeled 
hERa WT and mutants to GST-GRIP 1 (GRIP1 amino acids 721-1 121) also expressed as the 
percentage of WT is plotted for each mutant. 
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Figure 4 shows a plot of the fold E 2 activation observed when the indicated amounts of the 
ftill-length GRIP1 expression vector, P SG5-GRIP1, are added to the co-transfection experiment, 
which otherwise is performed as for Figure 3. The WT or different hERcc mutants are indicated. 
The data represent the averages of three independent experiments, with standard deviations less 
than 10%. 



Figure 5 shows a CPK model of the TRa-LBD, indicating the LBD surface locations of 
mutations made in the full-length hTR.pi. Mutated residues having no effect on GRIP1 binding or 
effect on activation in HeLa cells are shaded gray. Mutated residues with diminished GRIP1 and 
SRC-la binding and diminished activation in HeLa cells are colored to reflect chemical properties 
of the residues: red, blue (purple), and green indicate acidic, basic, and hydrophobic residues, 
respectively. The main chain structures of the TRa- and TRp-LBDs are the same (data not shown). 

Figure 6 shows sequence alignment of amino acid residues of members of the pl60 
coactivator family. Single amino acid designations are used. Members of the pi 60 coactivator 
family interact with the nuclear receptors through conserved (SEQ ID NO: 1) LxxLL motifs. 

Figure 7 shows binding affinity assays of GST-GRIP1 constructs with NR-boxes 1 2 
and/or 3 and their interaction with TR LBD. GRIP-1 NR boxes 1,2 and 3 interact differently with 
TRP LBD. Single letter designations are used for the amino acids. 

Figure 8 shows binding affinity assays of GST-GRIP1 constructs with NR-boxes 1, 2, 

and/or 3 and their interaction with TR and GR LBD* TR ™* no • *u • • • 

xv aim vjiv l,dl)s. i K. and GR differ in their interactions with 

GRIP-1. 

Figure 9 shows binding affinity assays for NR-box 2- and 3-peptides and GRIP1 and their 
afteraction with TR LBD. NR box 2- and 3-containing peptides reproduce the affinity and 
specificity of the NR interaction domain. 

Figure 10 shows binding affinity assays for NR-box 2- and 3- P e P tides and their interaction 
with TR LBD. Sequence adjacent to the (SEQ ID NO: 1) LxxLL motif modulate the affinity of ■ 
30 NR-box-TRp LBD interactions. 
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Figure 11 shows binding affinity assays for mutant GRIP1 and NR-box 2- and 3-peptides 
and their interaction with TR LBD. The individual leucine residues of the (SEQ ID NO: 1) LxxLL 
motif are crucial for binding of the GRIP-1 NR interaction domain to TRp LBD. 

Figure 12 shows the contents of the asymmetric unit of the crystallized hTRB LBD:GRIP1 
NR-box 2 peptide complex. The crystal lattice consists of a repeating unit containing a 2:2 compl, 
of hTR LBD and GRIP1 site 2 peptide. Positions of the two GRIP1 site 2 peptides are boxed, i, 
green (sitel), and red (site 2), with the peptides drawn as a C-alpha trace. The two NCS related 
monomers of the hTR LBD are shown as a secondary structure ribbon drawing, with monomer 1 in 
light grey, and monomer 2 in dark grey. The side chains of the hydrophobic residues 1689 L690 
L693, L694 of the GRIP1 NR-box 2 peptides are drawn to emphasize those interactions observed' in 
15 both bound peptides. 

Figure 13 shows a ribbon diagram depicting the interaction of the GRIP1 NR-box 2 peptide 
with the hTRB LBD. The GRIP! NR-box 2 peptide (dark grey) forms three turns of a-helix, and 
binds the hTR LBD (light gray) in.a hydrophobic cleft defined by helices H3, H4, H5, and H12 
Portions of the hTRB LBD, and the neighboring monomer, are omitted for clarity. 



Figure 14 shows interface between the GRIP1 NR-box 2 peptide and the hTRB LBD Side 
chains of residues of the hTRB LBD within 4.5A of the GRIP-1 NR-box 2 peptide are labeled The 
color of the mdividual side chains reflects the chemical nature of the residue: acidic residues are 
red, basic residue are blue, aliphatic residues are green, aromatic residues are brown, and polar 
residues are orange. The peptide is depicted as a C-alpha trace with the side chains of (SEQ ID 
25 NO: 2) ILxxLL motif shown explicitly. 

Figure 15 shows residues in the hTRB LBD that are necessary for transaction. The 
transaction mutations are mapped onto the interface between the GRIP1 NR-box 2 peptide and 
the hTRB LBD. 



Figure 16 shows molecular surface of the hTR LBD. The side chains of the leucines resides 
30 fit within a hydrophobic groove formed from helices H3, H5, and HI2. while the side chain of the 
non-conserved isoleucine residue packs against the outside edge of the groove. The remainder of 
the peptide is shown as main chain. 
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5 Figure 17 shows complementarity between the (SEQ ID NO: 1 ) LxxLL motif and the 

surface of the hTR LBD. The side chains of the (SEQ ID NO: 2) ILxxLL motif are shown in a CPK 
representation, with the main chain of the peptide drawn as a C-alpha trace. The three ieucince 
residues fit into pockets of the coactivator binding site of the hTRB LBD, depicted as mesh, while 
the isoleucine residue rests on the edge of the site's cleft. 

10 Figure 18 shows the coactivator binding site cleft, one side of which is formed by 

conformationally hormone-responsive residues. On the left is a view of the TR-LBD molecular 
surface showing the concave surfaces in gray . The cavity is shown at the center of the figure. On 
the right is shown a CPK model of the TR-LBD, overlaid with a molecular surface view, which is 
restricted to a 12A radius of the hydrophobic cavity. Mutated residues of the coactivator binding 

15 site that are hormone-insensitive (V284, K288, 1302 and K306) are located on one side of the cleft 
and are colored yellow. Mutated CBS residues likely undergo a conformational change upon 
hormone binding (L454 and E457) are located on the opposite side of the cleft and are colored red. 

Figure 19 shows alignment of amino acid sequences (single letter amino acid designations) 
containing residues that form the coactivator binding sites of several nuclear receptors. The boxes 
20 represent residues of alpha-helix (H3, H4, H5, H6 and HI 2); lower case letters "h" and "q" 
represent hydrophobic and polar residues, respectively. 
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DESCRIPTI ON OF SPECIFIC EMBODIMENTS 

The present invention provides methods and compositions for identifying compounds that 
modulate nuclear receptor activity. The compounds can be nuclear receptor agonists or antagonists 
that bind to the coactivator binding site (and that act as mimetics to the coactivator in this regard) 
and promote (agonists) or block (antagonists) binding of the coactivator to the target nuclear 
receptor. Compounds that bind to the coactivator binding site also are provided. The compounds 
can be natural or synthetic. Preferred compounds are small organic molecules, peptides and 
peptidomimetics (e.g., cyclic peptides, peptide analogs, or constrained peptides). 

As described in the Examples, mutagenesis and coactivator binding studies, coupled with 
analysis of atomic models derived from cocrystals, reveals for the first time a previously unknown 
structure for nuclear receptors, the coactivator binding site. By "coactivator binding site" is 
intended a structural segment or segments of nuclear receptor polypeptide chain folded in such a 
way so as to give the proper geometry and amino acid residue conformation for binding a 
coacnvator. This is the physical arrangement of protein atoms in three-dimensional space forming a 
coactivator binding site pocket or cavity. Residues forming the site are amino acids corresponding 
to (i.e., the same as or equivalent to) human TR residues of C-terminal helix 3 (Ile280 Thr281 
Val283, Val284, Ala287, and Lys288), helix 4 (Phe293), helix 5 (Gln301, Ile302, Leu305, Lys306)' 
hehx 6 (Cys309), and helix 12 (Leu454, Glu457, Val458 and Phe459). The coactivator binding site 
.s highly conserved among the nuclear receptor super family (Figure 19). Thus, this site 
corresponds to a surprisingly small cluster of residues on the surface of the LBD that form a 
promment hydrophobic cleft. The hydrophobic cleft is formed by hydrophobic residues 
corresponding to human TR residues of C-terminal helix 3 (Ile280, Val283, Val284, and Ala287) 
hehx 4 (Phe293), helix 5 (Ile302 and Leu305), helix 6 (Cys309), and helix 12 (Leu454, Val458 and 
Phe459). The hydrophobic cleft of the coactivator binding site also is highly conserved among the 
nuclear receptor super family (Figure 19). 

The invention also includes compositions and methods for identifying coactivator binding 
sites of nuclear receptors. The methods involve examining the surface of a nuclear receptor of 
interest to identify residues that modulate coactivator binding. The residues can be identified by 
homology to the coactivator binding site of human TR described herein. A preferred method is 
alignment with the residues of any nuclear receptor corresponding to (i.e., equivalent to) human TR 



WO 99/60014 1Q P<^T/US99/06899 -_ 

5 residues of the C-terminal helix 3 (Ile280, Thr281, Val283, Val284, Ala287, and Lys288), helix 4 
(Phe293), helix 5 (Gih30 1 ? Ile302; Leu305, Lys306), helix 6 (Cys309), and helix 12 (Pro453, 
Leu454, Glu457, Val458 and Phe459). Overlays and superpositioning with a three-dimensional 
model of a nuclear receptor LBD, or a portion thereof that contains a coactivator binding site, also 
can be used for this purpose. For example, three-dimensional structures of TR, RAR, RXR and ER 

10 LBDs can be used for this purpose. For example, nuclear receptors identifiable by homology 
alignment include normal nuclear receptors or proteins structurally related to nuclear receptors 
found in humans, natural mutants of nuclear receptors found in humans, normal or mutant receptors 
found in animals, as well as non-mammalian organisms such as pests or infectious organisms, or 
viruses* 

15 Alignment and/or modeling also can be used as a guide for the placement of mutations on 

the LBD surface to characterize the nature of the site in the context of a cell. Selected residues are 
mutated to preserve global receptor structure and solubility. To destroy the coactivator binding 
interaction, preferred mutations are to charged residues (e.g., Arg, Lys, or Glu) on the basis that 
bulky, surface charged residues might disrupt coactivator binding, yet preserve global receptor 

20 structure and solubility. Mutants can be tested for coactivator binding as well as the relative change 
in strength of the binding interaction. Ligand-dependent coactivator interaction assays also can be 
tested for this purpose, such as those described herein. 

Compounds that bind to the coactivator binding site of nuclear receptors can be identified by 
computational modeling and/or screening. For example, coactivator agonists or antagonists can be 

25 identified by providing atomic coordinates comprising a nuclear receptor coactivator binding site or 
portion thereof to a computerized modeling system, modeling them, and identifying compounds that 
fit spacially into the coactivator binding site. By a "portion thereof is intended the atomic 
coordinates corresponding to a sufficient number of residues or their atoms of the coactivator 
binding site that interact with a compound capable of binding to the site. This includes receptor 

30 residues having an atom within 4.5A of a bound compound or fragment thereof. For instance, 
human TR residues V284, Phe293, Ile302, Leu305 and Leu454 contain side chain atoms that are 
within 4.5 A, and interact with, hydrophobic residues of a (SEQ ID NO: 1) LxxLL motif of an NR- 
box 2 coactivator peptide. As another example, an atomic structural model utilized for 
computational modeling and/or screening of compounds that bind to the coactivator binding site 

35 may include a portion of atomic coordinates of amino acid residues corresponding to the site 

composed of residues of human thyroid receptor selected from Val284 ? Lys288, Ile302, Lys306, 
Leu454 and Glu457. or their structural and functional equivalents found in other receptors. Thus, 
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for example, the atomic coordinates provided to the modeling system can contain atoms of the 
nuclear receptor LBD, part of the LBD such as atoms corresponding to the coactivator binding site 
or a subset of atoms useful in the modeling and design of compounds that bind to a coactivator 

binding site. 

The atomic coordinates of a compound that fits into the coactivator binding site also can be 
used for modeling to identify compounds or fragments that bind the site. By "modeling" is 
intended quantitative and qualitative analysis of molecular structure/function based on atomic 
structural information and receptor-coactivator agonists/antagonists interaction models. This 
includes conventional numeric-based molecular dynamic and energy minimization models, 
mteractive computer graphic models, modified molecular mechanics models, distance geometry and 
other structure-based constraint models. Modeling is preferably performed using a computer and 
may be further optimized using known methods. By "fits spacially" is intended that the three- 
dimensional structure of a compound is accommodated geometrically by a cavity or pocket of a 
nuclear receptor coactivator binding site. 

Compounds of particular interest fit spacially and preferentially into the coactivator binding 
site. By "fits spacially and preferentially" is intended that a compound possesses a three- 
dimensional structure and conformation for selectively interacting with a nuclear receptor 
coactivator binding site. Compounds that fit spacially and preferentially into the coactivator 
bmdmg site interact with amino acid residues forming the hydrophobic cleft of this site. In 
particular, the hydrophobic cleft of the coactivator binding site comprises a small cluster of 
hydrophobic residues. The site also contains polar or charged residues at its periphery. The present 
invention also includes a method for identifying a compound capable of selectively modulating 
coactivator binding to different nuclear receptors. The method comprises the steps of modeling test 
compounds that fit spacially and preferentially into the coactivator binding site of a nuclear receptor 
of interest using an atomic structural model of a nuclear receptor, screening the test compounds in a 
biological assay for nuclear receptor activity characterized by preferential binding of a test 
compound to the coactivator binding site of a nuclear receptor, and identifying a test compound that 
selectively modulates the activity of a nuclear receptor. Such receptor-specific compounds are 
selected that exploit differences between the coactivator binding sites of one type of receptor versus 
a second type of receptor, such as the differences depicted in Figure 19. 

The invention also is applicable to generating new compounds that distinguish nuclear 
receptor isoforms. This can facilitate generation of either tissue-specific or function-specific 
compounds. For instance, GR subfamily members have usually one receptor encoded by a single 
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5 gene, although there are exceptions. For example, there are two PR isoforms, A and B, translated 
from the same mRNA by alternate initiation from different AUG codons. There are two GR forms, 
one of which does not bind ligand. This method is especially applicable to the TR subfamily which 
usually has several receptors that are encoded by at least two (TR: a, P) or three (RAR, RXR, and 
PPAR: a, P, y) genes or have alternate RNA splicing. 

10 The receptor-specific compounds of the invention preferably interact with conformationally 

constrained residues of the coactivator binding site that are conserved among one type of receptor 
compared to a second type of receptor. "Conformationally constrained" is intended to refer to the 
three-dimensional structure of a chemical or moiety thereof having certain rotations about its bonds 
fixed by various local geometric and physical-chemical constraints. Conformationally constrained 

15 structural features of a coactivator binding site include residues that have their natural flexible 
conformations fixed by various geometric and physical-chemical constraints, such as local 
backbone, local side chain, and topological constraints. These types of constraints are exploited to 
restrict positioning of atoms involved in receptor-coactivator recognition and binding. 

For instance, comparison of sequences of the GR and TR coactivator interaction surface 

20 shows a highly negatively charged sequence at the C-terminal end of TR helix 12 (E460 and D461) 
that is neutral in the equivalent positions in GR helix 12 (GR residues T788 and N759, 
corresponding to TR residue positions 460 and 461, as depicted in Figure 19). As described in the 
Examples, the cocrystal of the hTRB LBD complexed with the GRIP1 NR-box 2 peptide shows that 
TR residues E460 and D461 interact with positively charged residues of the NR-box 2 peptide. 

25 Also, when comparing the RAR LBD structure to that of the TR LBD, conformation of helix 12 
differs slightly, whereas helices 3, 4, 5 and 6 are substantially the same. Thus, differences in helix 
12, particularly charge differences at the C-terminal end of the helix, may modulate preferential 
interaction of TR for NR-box 2 containing coactivators. As further demonstrated in the Examples, 
TR and GR differ in their specificity for different NR-boxes containing the conserved (SEQ ID NO: 

30 1) LxxLL motif found in members of the pi 60 family of coactivator proteins. As also demonstrated 
in the Examples, GR but not TR is able to interact with peptides containing the hydrophobic 
interaction motifs of p53 (SEQ ID NO: 3; FxxLW) and VP16 (SEQ ID NO: 4; FxxAL). Thus, TR 
exhibits preferential interaction with NR-box peptides comprising the (SEQ ID NO: 1) LxxLL 
motif, but GR does not discriminate and can bind peptides containing a generic amphipathic helix 

35 motif. Accordingly, these real differences among the various nuclear receptors can be exploited in 
the identification and design of compounds that modulate coactivator binding to one nuclear 
receptor compared to another. 
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For modeling, docking algorithms and computer programs that employ them can be used to 
identify compounds that fit into the coactivator binding site. For example, docking programs can be 
used to predict how a small molecule of interest can interact with the nuclear receptor coactivator 
binding site. Fragment-based docking also can be used in building molecules de novo inside the 
coactivator binding site, by placing chemical fragments that complement the site to optimize 
intermodular interactions. The techniques can be used to optimize the geometry of the binding 
interactions. This design approach has been made possible by identification of the coactivator 
binding site structure thus, the principles of molecular recognition can now be used to design a 
compound which is complementary to the structure of this site. Compounds fitting the coactivator 
binding site serve as a starting point for an iterative design, synthesis and test cycle in which new 
compounds are selected and optimized for desired properties including affinity, efficacy, and 
selectivity. For example, the compounds can be subjected to addition modification, such as 
replacement and/or addition of R-group substituents of a core structure identified for a particular 
class of binding compounds, modeling and/or activity screening if desired, and then subjected to 
additional rounds of testing. 

Computationally small molecule databases can be screened for chemical entities or 
compounds that can bind in whole, or in part, to a nuclear receptor coactivator binding site of 
interest. In this screening, the quality of fit of such entities or compounds to the binding site may be 
judged either by shape complementarity (DesJalais et al., J. Med. Chem. (1988) 57:722-729) or by 
estimated interaction energy (Meng et al., J. Camp. Chem. (1992) 7J:505-524). The molecule 
databases include any virtual or physical database, such as electronic and physical compound 
library databases, and are preferably used in developing compounds that modulate coactivator 
. binding. 

Compounds can be designed intelligently by exploiting available structural and functional 
information by gaining an understanding of the quantitative structure-activity relationship (QSAR), 
using that understanding to design new compound libraries, particularly focused libraries having 
chemical diversity of one or more particular groups of a core structure, and incorporating any 
structural data into that iterative design process. For example, one skilled in the art may use one of 
several methods to screen chemical entities or fragments for their ability to associate with the 
coactivator binding site of a nuclear receptor of interest. This process may begin by visual 
inspection of, for example, the coactivator binding site on the computer screen. Selected fragments 
or chemical entities may then be positioned into all or pan of the site. Docking may be 
accomplished using software such as Quanta and Sybyl, followed by energy minimization and 
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5 molecular dynamics with standard molecular mechanics force-fields, such as CHARMM and 
AMBER. 

For example, compounds and/or fragments can be designed to fill up the hydrophobic cleft, 
the pocket deep within the cleft, the upper end of the site, and/or the lower end of the site. Residues 
comprising a coactivator binding site, when defined by the user as those residues having an atom 

10 within 4.5 A of an atom of a bound chemical entity, can be modeled to look for energetic 
contributions and interaction with the bound chemical entity. For example, a compound or 
fragment can be designed to contain hydrophobic groups that interact with hydrophobic residues of 
the coactivator binding site. As described in the examples, human TR residues V284, Phe293, 
Ile302, Leu305 and Leu454 contain side chain atoms that are within 4.5A, and interact with, 

15 hydrophobic residues of a (SEQ ID NO: 1) LxxLL motif of an NR-box 2 coactivator peptide. Thus, 
for example, peptides and/or peptide mimetics having a hxxhh motif, where "h" is a hydrophobic 
residue and x is any residue, can be constructed. Small organic molecules that mimic one or more 
of these particular interactions also can be designed, for example, by including one or more R- 
groups that are hydrophobic and fit into the site. 

20 Specialized computer programs may also assist in the process of selecting chemical entity 

fragments or whole compounds. These include: GRID (Goodford, J. Med Chem. (1985) 25:849- 
857; available from Oxford University, Oxford, UK); MCSS (Miranker et al., Proteins: Structure, 
Function and Genetics^ (1991) 77:29-34; available from Molecular Simulations, Burlington, MA); 
AUTODOCK (Goodsell et al., Proteins: Structure, Function and Genetics (1990) 5:195-202; 

25 available from Scripps Research Institute, La Jolla, CA); and DOCK (Kuntz et al, J. Mol BioL 
(1982) 757:269-288; available from University of California, San Francisco, CA). 

Additional commercially available computer databases for small molecular compounds 
include Cambridge Structural Database and Fine Chemical Database (Rusinko, Chem. Des. Auto. 
News (1993) 5:44-47). 

30 Once suitable chemical entities or fragments have been selected, they can be assembled into 

a single compound. Assembly may be proceeded by visual inspection of the relationship of the 
fragments to each other on the three-dimensional image displayed on a computer screen in relation 
to the structure coordinates of a nuclear receptor. This can be followed by manual model building 
using software such as Quanta or SybyL 
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5 Useful programs to aid one of skill in the art in connecting the individual chemical entities 

or fragments include: CAVEAT (Bartlett et al., "CAVEAT: A Program to Facilitate the Structure- 
Derived Design of Biologically Active Molecules", In: Molecular Recognition in Chemical and 
Biological Problems", Special Pub., Royal Chem. Soc. (1989) 75:182-196; CAVEAT is available 
from the University of California, Berkeley, CA); 3D Database systems such as MACCS-3D (MDL 
10 Information Systems, San Leandro, CA; reviewed in Martin, J. Med Chem. (1992) 35:2145-2154); 
and HOOK (available from Molecular Simulations, Burlington, MA). 

In addition to building a compound in a step-wise fashion, one fragment or chemical entity 
at a time as described above, compounds that bind to a coactivator binding site of interest also may 
be designed as a whole or de novo using either an empty coactivator binding site or optionally 
1 5 including some portion(s) of a molecule known to binds to the site, such as an NR-box type peptide. 
These methods include: LUDI (Bohm, J. Comp. Aid. Molec. Design (1992) 6:61-78; LUDI is 
available from Biosym Technologies, San Diego, CA); LEGEND (Nishibata et al., Tetrahedron 
(1991) 47:8985; LEGEND is available from Molecular Simulations, Burlington, MA); and 
LeapFrog (available from Tripos Associates, St. Louis, MO). 

20 Other molecular modeling techniques may also be employed in accordance with this 

invention. See, for example, Cohen et al., J. Med. Chem. (1990) 55:883-894); Navia et al., Curr. 
Opin. Struct. Biol. (1992) 2:202-210). For example, where the structures of test compounds are 
known, a model of the test compound may be superimposed over the model of the structure of the 
invention. Numerous methods and techniques are known in the art for performing this step, any of 

25 which may be used. See, for example, Farmer, "Drug Design," Aliens, EJ., ed., 10:119-143 
(Academic Press, New York, 1980); U.S. Patent No. 5,331,573; U.S. Patent No. 5,500,807; 
Verlinde, Structure, (1994) 2:577-587); and Kuntz et al., Science, (1992) 257:1078-1082). The 
model building techniques and computer evaluation systems described herein are not a limitation on 
the present invention. 

30 Using these computer modeling systems a large number of compounds may be quickly and 

easily examined and expensive and lengthy biochemical testing avoided. Moreover, the need for 
actual synthesis of many compounds can be substantially reduced and/or effectively eliminated. 

Compounds identified through modeling can be screened in an assay characterized by 
binding of the compound to a coactivator binding site of interest for coactivator binding activity, 
35 such as a biologically based assay. Screening can be in vitro and/or in vivo. Preferred assays 
include cell-free competition assays and cell culture based assays. The biological screening 
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preferably centers on activity-based response models, binding assays (which measure how well a 
compound binds to the receptor), and bacterial, yeast and animal cell lines (which measure the 
biological effect of a compound in a cell). The assays can be automated for high capacity - high 
throughput screening (HTS) in which large numbers of compounds can be tested to identify 
compounds with the desired activity. 



As an example, in vitro binding assays can be performed in which compounds are tested for 
their ability to block the binding of a coactivator protein, fragment, fusion or peptide thereof, to a 
coactivator binding site of interest. For cell and tissue culture assays, they may be performed to 
assess a compound's ability to block function of cellular coactivators, such as members of the pl60 
family of coactivator proteins, such as SRC-1, AIB1, RAC3, p/CIP, and GRIP1 and its homologues 
15 TIF 2 and NcoA-2, and those that exhibit receptor and/or isoform-specific binding affinity. In a 
preferred embodiment, compounds of the invention bind to a nuclear receptor coactivator binding 
site with greater affinity than the cellular coactivator proteins. Tissue profiling and appropriate 
animal models also can be used to select compounds. Different cell types and tissues also can be 
used for these biological screening assays. Suitable assays for such screening are described herein 
and in Shibata et al. (Recent Prog. Horm. Res. 52: 141 -164 (1997)); Tagami et al. (Mol. Cell Biol. 
(1997) 77(5):2642-2648); Zhu et al. (J. Biol. Chem. (1997) 272(14):9048-9054); Lin et aL (Mol. 
Cell Biol. (1997) /7(10):6131-6138); Kakizawa et aL (J. Biol. Chem. (1997) 272(38):23799-23804); 
and Chang et al. (Proc. Natl. Acad. Sci. USA (1997) P<17):9040-9045), which references are 
incorporated herein in their entirety by reference. For example, coactivators or binding fragments 
25 thereof can be expressed and/or assayed for binding as for GRIP1 (Hong et al., MCB supra : and 
Hong et al., PNAS supra) and/or SRC-1 (Spencer et al., Nature (1997) 389: 194-1 98; Onate et al., 
Science (1995) 270:1354-1357), incorporated by reference. 

The compounds selected can have agonist and/or antagonistic properties. The compounds 
also include those that exhibit new properties with varying mixtures of agonist and antagonist ' 

30 activities, depending on the effects of altering coactivator binding in the context of different 
activities of nuclear receptors, either hormone-dependent or hormone-independent, which are 
mediated by proteins other than coactivators, and which interact with the receptors at locations 
other than the coactivator binding site. The compounds also include those, which through their 
binding to receptor locations that are conformational ly sensitive to hormone binding, have allosteric 

35 effects on the receptor by stabilizing or destabilizing the hormone-bound conformation of the 
receptor, or by directly inducing the same, similar, or different conformational changes induced in 
the receptor by the binding of hormone. 
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5 Of particular interest is use of such compounds in a method of modulating nuclear receptor 

activity in a mammal by administering to a mammal in need thereof a sufficient amount of a 
compound that fits spatially and preferentially into a coactivator binding site of a nuclear receptor 
of interest. By "modulating" is intended increasing or decreasing activity of a nuclear receptor. For 
example, pre-clinical candidate compounds can be tested in appropriate animal models in order to 
10 measure efficacy, absorption, pharmacokinetics and toxicity following standard techniques known 
in the art. Compounds exhibiting desired properties are then tested in clinical trials for use in 
treatment of various nuclear receptor-based disorders. These include ER-based disorders, such as 
postmenopausal symptoms and cancer resulting from loss of estrogen production, and osteoporosis 
and cardiovascular disease stemming from traditional estrogen replacement therapy. Others include 
TR-based disorders including cardiovascular disease, metabolic disorders, hyperthyroidism, 
glaucoma and skin disorders. GR-based disorders include Type II diabetes and inflammatory 
conditions such as rheumatic diseases. 

The invention also provides for cocrystals made from nuclear receptor ligand binding 
domains with a molecule bound to the coactivator binding site. As exemplified in the Examples, 
TR LBDs are co-crystallized with a peptide molecule comprising a coactivator NR-box 2 peptide 
sequence bound to the coactivator binding site, and the hormone/ligand T 3 . 

Crystals are made from purified nuclear receptor LBDs that are usually expressed by a cell 
culture, such as E. coli. Preferably, different crystals (cocrystals) for the same nuclear receptor are 
separately made using different coactivators-type molecules, such as protein fragments, fusions or 
small peptides. The coactivator-type molecules preferably contain NR-box sequences necessary for 
binding to the coactivator binding site, or derivatives of NR-box sequences. Other molecules can 
be used in co-crystallization, such as small organics that bind to the coactivator or hormone binding 
site(s). Heavy atom substitutions can be included in the LBD and/or a co-crystallizing molecule. 

After the three dimensional structure of the cocrystal is determined, the structural 
information can be used in computational methods to design synthetic compounds for the nuclear 
receptor, and further structure-activity relationships can be determined through routine testing using 
the assays described herein and known in the art. 

Since nuclear receptor LBDs may crystallize in more than one crystal form, the structure 
coordinates of such receptors or portions thereof, as provided in Appendix 1, are particularly useful 
for solving the structure of those other crystal forms of nuclear receptors. They may also be used to 
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5 solve the structure of mutants or co-complexes of nuclear receptors having sufficient structural 
similarity. 

One method that may be employed for this purpose is molecular replacement. In this 
method, the unknown crystal structure, may be determined using the structure coordinates of this 
invention as provided in Appendix 1. This method will provide an accurate structural form for the 
10 unknown crystal more quickly and efficiently than attempting to determine such information ab 
initio. 

Atomic coordinate information gleaned from the crystals of the invention can be stored. In a 
preferred embodiment, the information is provided in the form of a machine-readable data storage 
medium. This medium contains information for constructing and/or manipulating an atomic model 

15 of a coactivator binding site or portion thereof. For example, the machine readable data for the 
coactivator binding site comprises structure coordinates of amino acids corresponding to human TR 
amino acids selected from C-terminal helix 3 (Ile280, Thr281, Val283, Val284, Ala287, and 
Lys288), helix 4 (Phe293), helix 5 <Gln301, Ile302, Leu305, Lys306), helix 6 (Cys309), and helix 
12 (Pro453, Leu454, Glu457, W1458 and Phe459), or a homologue of the molecule or molecular 

20 complex comprising the site. The homologues comprise a coactivator binding site that has a root 
mean square deviation from the backbone atoms of the amino acids of not more than 1.5 A. A 
preferred molecule or complex represents a compound bound to the coactivator binding site. 

The machine-readable data storage medium can be used for interative drug design and 
molecular replacement studies. For example, a data storage material is encoded with a first set of 

25 machine-readable data that can be combined with a second set of machine-readable data. For 
molecular replacement, the first set of data can comprise a Fourier transform of at least a portion of 
the structural coordinates of the nuclear receptor or portion thereof of interest, and the second data 
set comprises an X-ray diffraction pattern of the molecule or molecular complex of interest. Using 
a machine programmed with instructions for using the first and second data sets a portion or all of 

30 the structure coordinates corresponding to the second data can be determined. 

Protein for crystals and assays described herein can be produced using expression and 
purification techniques described herein and known in the art. For example, high level expression 
of nuclear receptor LBDs can be obtained in suitable expression hosts such as E. coli. Expression 
of LBDs in E. coli, for example, includes the TR LBD and other nuclear receptors, including 
35 members of the steroid/thyroid receptor superfamily, such as the receptors ER. AR, MR, PR, RAR, 
RXR and VDR. Yeast and other eukaryotic expression systems can be used with nuclear receptors 
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5 that bind heat shock proteins as these nuclear receptors are generally more difficult to express in 
bacteria, with the exception of ER, which can be expressed in bacteria. Representative nuclear 
receptors or their ligand binding domains have been cloned and sequenced: human RAR-a, human 
RAR-y, human RXR-cc, human RXR-a, human PPAR-a, human PPAR-0, human PPAR-y, human 
VDR, human ER (as described in Seielstad et ai 9 Molecular Endocrinol, (1995) 9:647-658, 
1 0 incorporated herein by reference), human GR, human PR, human MR, and human AR. The LBD 
for each of these receptors has been identified. 

Coactivator proteins can be expressed using techniques known in the art, particularly 
members of the pi 60 family of coactivator proteins that have been cloned and/or expressed 
previously, such as SRC-1, AIB1, RAC3, p/CIP, and GRIP1 and its homologues TIF 2 and NcoA-2. 
15 A preferred method for expression of coactivator protein is to express a fragment that retains 
transcriptional activation activity using the "yzast 2-hybrid" method as described by Hong et ai. 
(PNAS supra: and MCB suvrd ). for GRIP 1 expression, which reference is herein incorporated by 
reference. 

The proteins can be expressed alone, as fragments of the mature or full-length sequence, or 
20 as fusions to heterologous sequences. For example, TR can be expressed without any portion of the 
DBD or amino-terminal domain. Portions of the DBD or amino-terminus can be included if further 
structural information with amino acids adjacent the LBD is desired. Generally, for the TR the 
LBD used for crystals will be less than 300 amino acids in length. Preferably, the TR LBD will be 
at least 150 amino acids in length, more preferably at least 200 amino acids in length, and most 
25 preferably at least 250 amino acids in length. For example the LBD used for crystallization can 
comprise amino acids spanning from Met 122 to Val 410 of the rat TR-a or Glu 202 to Asp 461 of 
the human TR-p. 

Typically the LBDs are purified to homogeneity for crystallization. Purity of LBDs can be 
measured with sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE), mass 
30 spectrometry (MS) and hydrophobic high performance liquid chromatography (HPLC). The 
purified LBD for crystallization should be at least 97.5 % pure, preferably at least 99.0% pure, more 
preferably at least 99.5% pure. 

Initially, purification of the unliganded receptor can be obtained by conventional techniques, 
such as hydrophobic interaction chromatography (HPLC), ion exchange chromatography (HPLC), 
35 and heparin affinity chromatography. 
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To achieve higher purification for improved crystals of nuclear receptors, especially the TR 
subfamily and TR, the receptors can be ligand-shift-purified using a column that separates the 
receptor according to charge, such as an ion exchange or hydrophobic interaction column, and then 
bind the eluted receptor with a ligand, especially an agonist. The ligand induces a change in the 
receptor's surface charge such that when re-chromatographed on the same column, ligand-bound 
receptor is separated from unliganded receptor. Usually saturating concentrations of ligand are used 
in the column and the protein can be preincubated with the ligand prior to passing it over the 
column. The structural studies detailed herein indicate the general applicability of this technique 
for obtaining super-pure nuclear receptor LBDs for crystallization. 

Purification can also be accomplished by use of a purification handle or "tag," such as with 
at least one histidine amino acid engineered to reside on the end of the protein, such as on the N- 
terminus, and then using a nickel or cobalt chelation column for purification. (Janknecht et aL, 
Proc. Natl Acad. Set USA, (1991) 55:8972-8976) incorporated by reference. 

Typically purified LBD, such as TR LBD, is equilibrated at a saturating concentration of 
ligand at a temperature that preserves the integrity of the protein. Ligand equilibration can be 
established between 2 and 37°C, although the receptor tends to be more stable in the 2-20°C range. 
Preferably crystals are made with the hanging drop methods detailed herein. Regulated temperature 
control is desirable to improve crystal stability and quality. Temperatures between 4 and 25°C are 
generally used and it is often preferable to test crystallization over a range of temperatures. The 
crystals are then subjected to vapor diffusion and bombarded with x-rays to obtain x-ray diffraction 
pattern following standard procedures. 

For co-crystallization with a peptide that binds to the coactivator binding site, various 
concentrations of peptides containing a sequence that binds to a coactivator binding site of a nuclear 
receptor of interest can be used in-microcrystallization trials, and the appropriate peptides selected 
for further crystallization. Any number of techniques, including those assays described herein can 
assay peptides for binding to the coactivator binding site of a nuclear receptor of interest. In a 
preferred embodiment, a NR-box 2 sequence-containing peptide is used for crystallization with TR 
LBD. A preferred peptide contains a NR-box (SEQ ID NO: 1) LxxLL motif, and suitable flanking 
sequences necessary for binding and forming complex with coactivator binding site of the nuclear 
receptor of interest, such as a TR LBD. The binding peptides are then tested in crystallization trials 
at various concentrations and ratios of concentrations with a nuclear receptor of interest, for 
example, as described herein and in the Examples. For crystallization trials with TR LBD, the 
hanging drop vapor diffusion method is preferred. Conditions of pH, solvent and solute 



WO 99/60014 

21 PCT/US99/06899 -_ 

5 components and concentrations and temperature can be adjusted, for instance, as described in the 
Examples. In the handing drop method, to obtain suitable crystals for x-ray diffraction analysis, 
seeding of prepared drops with microcrystals of the complex can be used. Collection of structural 
information can be determined by molecular replacement using the structure of TR LBD 
determined herein or previously by Wagner et al., supr^ The structure is refined following 
1 0 standard techniques known in the art. 

There are many uses and advantages provided by the present invention. For example the 
methods and compositions described herein are useful for identifying peptides, peptidomimetics or 
small natural or synthetic organic molecules that modulate nuclear receptor activity The 
compounds are useful in treating nuclear receptor-based disorders. Methods and compositions of 
15 the mvention also find use in characterizing structure/fimction relationships of natural and synthetic 
coactivator compounds. 

The following examples illustrate various aspects of this invention. TTiese examples do not 
limit the scope of this invention. 
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EXAMPLES 

Example 1: Expression and purification of wild-type and mutant nuclear receptors and 
coactivators 

A. Human TRB LBD 

Human TRB LBD (His6-E202-D461) was expressed and purified as described (Shiau et al., 
Gene (1996) 1 79(2) :205- 10). Briefly, the protein was expressed from pET (e.g., pET3 and pET28) 
in BL21DE3 at 14°C, induced at OD(600nm) 0.7 with ImM IPTG and incubation was extended for 
24 hours. Cells were harvested and lysed in 50mM sodium-phosphate buffer (pH 8.0), 0.3M NaCl, 
10% glycerol, 25mM B-merceptoethanol and O.lmM PMSF as described above. The lysate was 
cleared by ultracentrifiigation (Ti45 ? 36000 rpm, lh, 4°C), loaded on a Talon column equilibrated in 
the sodium phosphate buffer described above, washed with 12mM imidazole and eluted with an 
imidazole gradient (12 - 300 mM)i ' TRB LBD containing fractions were loaded in 0.6M ammonium 
sulfate on a TSK-phenyl hydrophobic interaction column and eluted with a reverse ammonium 
sulfate gradient [0.6 - 0 M] in 50% glycerol and 10% acetonitrile. Fractions containing TRB LBD 
were tested for hormone binding, pooled and incubated with a 3-fold molar excess of T3 (Sigma). 
The hydrophobic interaction run was repeated with liganded receptor under the same conditions. 
Liganded receptor, which elutes earlier than unliganded receptor, was collected and buffer changed 
to 20mM Hepes pH7.0, 3mM DTT and 0. 1 |aM T 3 using NAP columns (Pharmacia). For 
crystallization, the protein was concentrated by ultrafiltration (Millipore UFV2BGC10 
concentrators) to a final concentration of 9mg/ml. The yield was about 9.5mg protein per liter 
bacterial culture. 

B. Human TR mutants 

Thirty-seven thyroid receptor mutants were created by synthesizing double-stranded 
oligonucleotides which encode the mutant sequence and which have ends allowing them to be 
ligated as a cassette using pairs of the Nsil, PstI, SstL AlwNI, ApoL PflMI, BstXL BseRI, BsmFI, 
PvuII, Nspl, Smal, Pmll, BglH and Bsml restriction sites of the hTRpl cDNA sequence, or the 3' 
plasmid polylinker Sail, or BamHI restriction sites. The hTRpl sequences thus mutated were 
subcloned into the pCMX vector encoding the full-length 461 amino acid hTRpi sequence. Some 
of the mutations of the hTRpi in the CMX vector and all three mutations of the hERct in the pSG5- 
ER-HEGO vector (Tora et al., EMBO (1 989) 8: 198 1 ) were created using Quick Change Site- 
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Directed Mutagenesis Kits (Stratagene). The mutated sequences were verified by DNA sequencing 
using Sequenase Kits (Stratagene). 

C. Human ERnr I Rn 

The human ERcc-LBD 297-554 was overexpressed as described previously (Seielstad, et al., 
suEra) in BL21(DE3)pLysS cells transformed with a modified pET-23d-ERG vector that contained 
the sequence Met-Asp-Pro fused to residues 297 to 554 of the hERa (provided by Paul Sigler of 
Yale University). Clarified bacterial lysates were adjusted to 3 M in urea and 0.7 M in NaCl and 
then applied to a 10-ml column of estradiol-Sepharose (Greene, et al., Proc. Natl. Acad. Sci. USA 
(1980) 77:51 15-5119; Landel, et al., Mol. Endocrinol. (1994) 5:1407-1419; Landel, etaL, J. Steroid 
Biochem. Molec. Biol. (1997) 6"5:59-73). 



To carboxymethylate the solvent-accessible cysteines, the bound hERa-LBD was treated 
with 5 mM iodoacetic acid in 10 mM Tris, pH 8.1, 250 mM NaSCN (Hegy, et al., Steroids (1996) 
57:367-373). Protein was eluted with 3 x 10-5 M ligand (either DES or OHT) in 30-100 ml of 50 
mM Tris, 1 mM EDTA, 1 mM DTT and 250 mM NaSCN, pH 8.5. The yield of hERa-LBD was 
typically close to 100% (Seielstad, et al., Biochemistry (1995) 5*12605-12615). The affimty- 
purified material was concentrated and exchanged into 20 mM Tris, 1 mM EDTA, 4 mM DTT, pH 
8.1 by ultrafiltration. The protein-was bound to a Resource Q column (Pharmacia) and then eluted 
with a linear gradient of 25-350 mM NaCl in 20 mM Tris, pH 8.1, 1 mM DTT. The hERce-LBD- 
ligand complexes eluted at 150-200 mM NaCl. Pooled fractions were concentrated by 
ultrafiltration and analyzed by SDS-PAGE, native PAGE, and electrospray ionization mass 
25 spectrometry. 

D. Human ER mutants 

To test the importance of the NR box peptide/LBD interface observed in the crystal, a series 
of site-directed mutations were introduced into the ERcc LBD. These mutations were designed 
either to simultaneously perturb the structural integrity and the nonpolar character of the floor of the 
binding groove (He 358->Arg, Val 376->Arg and Leu 539->Arg) or to prevent the formation of the 
capping interactions (Lys 362->Ala and Glu 542->Lys). Fusions of glutathione-S-transferase 
(GST) to the wild-type and mutant LBDs were analyzed for their ability to bind 35 S -labeled GRIP1 
in the absence of ligand or in the presence of DES or OHT. 
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5 35 S-labeled GRIP1 was incubated with either immobilized GST, immobilized wild type 

GST-hERa LBD, or immobilized mutant GST-LBDs in the absence of ligand or in the presence of 
DES or OHT. The bound GRIP1 was quantitated after SDS-PAGE. I358R, mutant LBD 
containing a Ile->Arg substitution at residue 358; K362A, mutant LBD containing a Lys->Ala 
substitution at residue 362; V376R, mutant LBD containing a Val->Arg substitution at residue 376; 

10 L539R, mutant LBD containing a Leu->Arg substitution at residue 539; E542K, mutant LBD 
containing a Glu->Lys substitution at residue 542. 

In the absence of ligand or in the presence of OHT, fusions to the wild-type protein and all 
of the mutant LBDs showed no detectable binding to GRIP1. The He 358->Arg, Val 376->Arg and 
Leu 539->Arg mutants were all unable to interact with coactivator in the presence of agonist, 
15 confirming the importance of the packing interactions observed in the crystal. Disruption of either 
the N- or C-terminal capping interaction also compromised GRIP1 binding in the presence of 
agonist. Only the wild-type GST-LBD was able to recognize the coactivator in the presence of 
DES. 

E. Human ER LBD-GST Fusion Protein 

20 A fusion between glutathibrie-S-transferase (GST) and amino acids 282-595 of hERa was 

constructed by subcloning the EcoRI fragment from pSG5 ERa-LBD (Lopez et aL, submitted 
manuscript) into pGEX-3X (Pharmacia). The He 358-> Arg, Lys 362->Ala, and Leu 539->Arg 
mutations were introduced intake GST-LBD construct using the QuikChange Kit (Stratagene) 
according to the manufacturer's instructions. The Val 376->Arg and Glu 542->Lys mutations were 

25 created in the GST-LBD construct by subcloning the Bsml/Hindlll fragments of derivatives of 
pSG5-ER-HEGO (Tora, et aL, supra) into which these mutations had already been introduced. All 
constructs were verified by automated sequencing (University of Chicago Cancer Research Center 
DNA Sequencing Facility). 

F. Radiolabeled full-length receptors and coactivator proteins 

30 Wild-type (WT) or mutant pCMV-hTRpl vector and the pSG5-GRIPl and pCNDC-SRC-la 

vectors were used to produce radiolabeled full-length receptors and coactivator proteins using the 
TNT coupled Reticulocyte Ly sate System (Promega) and [ 35 S]-Met (DuPont). GST-GRIP 1 (amino 
acids 721-1221), GST-GRIP1 (amino acids 563-1121), GST-SRC-la (amino acids 381-882), GST- 
hTRfil (ftill-length, WT or mutants, WT provided by. C. Costa), and the GST-hRXRa (full-length 
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5 provided by. C. Costa), fusion proteins were produced in E. coli strain HB101 as per the 
manufacturer's protocol (Pharmacia Biotech). 

G - Coactivato r GRIP1 563-767 His6 GST fusion protein 

GRIP1 563-767 was cloned as a Bam Hl-Xho I fragment derived from pGEX-2TK GRIP1 
563-1121 into the corresponding sites of P GEX-4T1. A His6-tag was added by inserting a Xho I- 

10 Nae I fragment of pET23a into Xho I-Bsa AI sites of this pGEX-4Tl construct yielding pGEX 
GRIP1 563-767His6. Mutants of GRIP1 563-767 were generated by PCR or single stranded 
mutagenesis using oligonucleotides carrying the mutations and a pSG5 GRIP1 vector as template. 
The mutations were confirmed by sequence analysis and integrated into pGEX GR1P1 563-767His6 
as NgoMI - Xho I fragments. The GRIP1 563-767 His6 GST fusion protein was expressed in 

15 HB 1 01 at 37°C. Protein expression was induced with 1 mM IPTG at an optical density (600 nm) of 
0.7 and extended for 4 hours after induction. Cells were harvested by centrifugation, resuspended 
in sonication buffer (20mM TrisHCl pH 8.0, 0.1M NaCl, 1 0%glycerol, 0.1 mM PMSF and protease 
inhibitors (Complete, EDTA free, Boehringer Mannheim)). The resuspended cells were freeze- 
thawed once, incubated on ice with O.lmg/ml lysozyme for 20 minutes and lysed per sonication. 

20 The lysate was cleared by ultracentrifugation (Ti 45, 36000rpm, lh 4°C), the supernatant filtered 
(Costar 0.2 um top filter) and loaded on a Talon column (Clontech). The column was washed with 
10 column volumes of sonication buffer supplemented with 12mM imidazole and eluted with an 
imidazole gradient [12 - lOOmM]. At this step the fusion proteins are about 95% pure. Imidazole 
was removed by gelfiltration on NAP columns (Pharmacia), and protein concentrations determined 

25 using the Biorad protein assay. Equal concentrations of the different derivatives of the fusion 
fragment were incubated with glutathione agarose (lh, 4°C) which was equilibrated in binding 
buffer (sonication buffer supplemented with ImM DTT, ImM EDTA and 0.01% NP-40). Beads 
were washed with at least 20 volumes of this buffer, diluted in binding buffer with 20% glycerol to 
40%, frozen in aliqots and stored at -70°C. 

30 H. Coactivator GRIP1 563-767 His6 

GRIP1 563-767 was cloned as a Bam HI - Xho I fragment derived fron pGEX GRIP1 563- 
767His6 into corresponding cloning sites of pET23a yielding pETGRIPl 563-767His6. The 
fragment was expressed in BL21DE3. Expression, cell lysis and Talon purification was identical as 
described for GST GRIP1 563-767His6. The protein eluted from a Talon column in two fractions, 
35 one at 1 2mM and one between 40 and 70mM imidazole. In the earlier eluting fraction the fragment 
was associated with a 70 kDa protein which was removed by a MonoQ run in 50mM TrisHCl 
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5 pH7.5, 10% glycerol, ImM EDTA, ImM DTT, O.lmM PMSF and protease inhibitors. GRIP1 563- 
767His6 eluted in the flow through and was concentrated by ultrafiltration. At this step the protein 
was more than 95% pure. 

Example 2: Peptide synthesis 

Coactivator peptides were obtained using standard techniques. All peptides were HPLC 
10 purified and analyzed by mass spectroscopy. Peptide concentrations were either determined 
spectroscopically using the tyrosine signal (A276 = 1450 M-lcm-1) or by amino acid analysis 
following standard techniques. 

Example 3: Binding assays with nuclear receptors and coactivators 

A. GST-GRIP Pull-down Assays and Peptide Competition Assays 

15 Binding experiments were performed by mixing glutathione beads containing 10 jug of GST 

fusion proteins (Coomassie Plus Protein Assay Reagent, Pierce) with 1-2 ^1 of the [ 35 S]-labeled 
wild-type or mutant hTRpi (25 fmoles, 4000 cpm of receptor), or coactivators in 150 jil of binding 
buffer (20 mM HEPES, 150 mM KC1, 25 mM MgCl 2 , 10% glycerol, 1 mM dithiothreitol, 0.2 mM 
phenylmethylsulfonyl fluoride, and protease inhibitors) containing 2 mg/ml BSA for 1.5 hrs in the 

20 presence or absence of 1 jaM T3. Beads were washed 3 times with 1 ml of binding buffer and the 
bound proteins were separated using 1 0% SDS-PAGE and visualized by autoradiography. Binding 
was quantitated by phosphorimaging using ImageQuant (Molecular Dynamics). 

For in vitro binding studies GR, TR and their derivatives were translated in the presence of 
[ 35 S]methionine using the TNT Coupled Reticulocyte System (Promega). Separate translations were 

25 performed in the presence and absence of IO^iM dexamethasone or l^M RU486 for GR and 10fiM 
triiodothyronine for TR. Expression was quantified by phosphoimager analysis (BAS2000, Fuji). 
For all binding assays 50}il of a 20% bead suspension containing either 1.6 or 4.0 |iM bound 
purified GST GRIP1 fragment (either 568-767 or 563-1121) was incubated with 0.2nl or lApl in 
vitro transcribed and translated TR or GR, respectively. Binding was performed in the binding 

30 buffer described above supplemented with 20 ng/ml BSA and appropriate hormone. The chosen 
GST GRIP1 fragment concentrations were sufficient to bind either 70 or 100% of the TR 
derivatives. The reaction was incubated at 4°C under rotation for 2 hours. In case of competition 
experiments, the appropriate concentration of peptides were added to the reaction before addition of 
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5 receptors. However, no differences in the results were noted by adding the peptides after half of the 
incubation of the GST GRIP1 fragment with nuclear receptors. This demonstrates that equilibrium 
is reached under the chosen conditions. Beads were washed five times with 200ul binding buffer + 
BSA at 4°C before elution of the bound proteins in 20ul SDS loading buffer. Eluted beads and 
input labeled protein were subjected to SDS-PAGE. The fraction of bound nuclear receptors was 
1 0 determined by phosphoimager analysis. 

B. GST-hTRfil Pull-down Assays 

Assay and analysis was performed as for Example 3 A. In vitro binding of [ 35 S]-labeled full- 
length GRIP1, [ 35 S]-labeled full-length SRC- la, and [ 3S S]-labeled full-length hRXRa, to GST- 
hTRpi wild-type (WT) and mutants was performed. Mutants V284R, K288A, I302R, L454R, and 
15 E457K all bound to hRXRa with an affinity equivalent to wild type hTR. All of these mutants 
showed decreased ability to bind GRIP1 and SRC- la, as expected from the results of Example 3 A. 
The same results were obtained when a GST-SRC 1 construct including SRC- la amino acids 381- 
882 was tested for binding of [ 33 S]-Met-labeled full-length hTRpl WT and mutants (data not 
shown). 

20 C. GST-hERcc LBD Pull-down Assays 

The wild-type and mutant GST-hERa LBDs were expressed in BL21(DE3) cells. Total 
ligand binding activity was determined by a controlled pore glass bead assay (Greene, et al., Mol. 
Endocrinol. (1988) 2:714-726) and protein levels were monitored by western blotting with a 
monoclonal antibody to hERa (H222). Cleared extracts containing the GST- hERcc LBDs were 

25 incubated in buffer alone (50 mM Tris, pH 7.4, 150 mM NaCl, 2 mM EDTA, 1 mM DTT, 0.5% 
NP-40 and a protease inhibitor cocktail) or with 1 uM of either DES or OHT for 1 hour at 4°C. 
Extract samples containing thirty pmol of GST-LBD were then incubated with 10 ul glutathione- 
Sepharose-4B beads (Pharmacia) for 1 hour at 4°C. Beads were washed five times with 20 mM 
HEPES, pH 7.4, 400 mM NaCl, and 0.05% NP-40. 35 S-labeled GRIP1 was synthesized by in vitro 

30 transcription and translation using the TNT Coupled Reticulocyte Lysate System (Promega) 
according to the manufacturer's instructions and pSG5-GRIPl as the template. Immobilized GST- 
hERa LBDs were incubated for 2.5 hours with 2.5 ul aliquots of crude translation reaction mixture 
diluted in 300 ul of Tris-buffered saline (TBS). After five washes in TBS containing 0.05% NP-40, 
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proteins were eluted by boiling the beads for 10 minutes in sample buffer. Bound 35 S-GRIP1 was 
quantitated by fluorography following SDS-PAGE. 



D. Electrophoretic Mobility Shift Assays 

GRIP1, a mouse pi 60 coactivator, recognizes the ERcc LBD in a ligand-dependent manner. 
The binding of agonists to the ERccLBD promotes recruitment of GRIP1, whereas binding of 
10 antagonists prevents this interaction (Norris, et al. r J. Biol Chem. (1998) 273:6679-88). While 
agonist-bound receptor will bind to all three of the NR boxes from GRIP1, ERct strongly prefers 
NR-box 2 (Ding, et ah, Mol Endocrinol (1998) 72:302-13). 

An electrophoretic mobility shift assay was used to directly assess the ability of the NR-box 
2 peptide to bind the purified ERa LBD in the presence of either DES or OHT. Eight microgram 
15 samples of purified hERa-LBD bound to either DES or OHT were incubated in the absence of the 
peptide, i.e., buffer alone, or in the presence of either a 2-fold or 10-fold molar excess of the GRIP1 
NR-box 2 peptide. The binding reactions were performed on ice for 45 minutes in 10 ^1 of buffer 
containing 20mM Tris, pH 8.1, ImM DTT, and 200mM NaCl and then subjected to 6% native 
PAGE. Gels were stained with GELCODE Blue Stain reagent (Pierce). 

20 In the presence of the NR-box 2 peptide, the migration of the DES-hERa-LBD complex was 

retarded. In contrast, peptide addition had no effect on the mobility of the OHT-hERct-LBD 
complex. Hence, this peptide fragment of GRIP1 possesses the ligand-dependent receptor binding 
activity characteristic of the full-length protein. 

Example: 4 Transfection assays with TR and hERa 

25 HeLa cell transfection and assay conditions are described (Webb et al., Mol Endocrinol 

(1995) 9:443). For TR assays, 5 ng of the reporter p(DR-4) 2 -TK-LUC consisting of two copies of 
the DR-4 element (a direct repeat of the consensus TR response element (TRE) spaced by 4 base 
pairs) placed upstream of a minimal (-327+45) thymidine kinase gene promoter linked to luciferase 
(LUC) coding sequences were used. A reporter containing palindromic TREs gave the same results 

30 (data not shown). Also, 2 \xg of the hTRfil expression vector, pCMX-TR (WT or mutant), and 0.5 
^g transfection control vector, pJ3LacZ, which contains the SV40 promoter linked to the (i- 
galactosidase gene, were used. Other cells co-transfected with vector or receptor constructs can be 
used for same purpose. Alternative cells expressing sufficient levels of an endogenous receptor(s), 
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or cells selected to express a single reporter, car, be used for transfection assays, including MCF-7 
cells expressing ER (Webb e. al., ^ , and G C cel!s expressing TR (Norman e, al., J Biol 
Chem. ( 1 989) 264: 1 2063- 1 2073). 

For hERa assays, 5 ug of estrogen responsive reporter plasmid encoding chloramphenicol 
acetyltransferase (CAT), pERE-collTATA (Sadovsky. et al., Mol Cell Biol. (1995) 75:1554) 0 5 ug 
expression vector encoding full-length hERa . pSG5-er HEGO (WT or mutants), and 2 ug of 
PjSlacz, were used. For the experiments of Figures 2 and 4, 0.5 ug of a full-length GRIP1 
expression vector, P SG5-GRIP1, was also included in the transfection. Transfected cells were 
treated w I th or without 1 uM T 3 or E 2 , as indicated. After culturing for 24 hrs, the LUC or CAT 
acnvmes were assayed and the P -galactosidase activities were also assayed to correct for 
Terences in transfection efficiencies. The triplicate points were averaged and standard deviations 
were less than 10%. 



20 



30 



Example 5: Hormone binding assays for wild-type and mutant TRs 

The T 3 binding affinity constants (Kd) for /„ vitro -translated WT and mutant TRs were 
measured using [-1] 3 ,5,3« triiodothyronine (^3) in gel filtration binding assays as 
descnbed (Apriletti et al., Protein Exp, Purif. (1995) 6:263). Both the Kd and standard error (S E ) 
values were calculated using the Prism computer program (GraphPad Software, Inc.). Mutations 
are ind.cated by the single-letter amino acid abbreviations, with the native residue name, followed 
by the primary sequence position number, and then the mutated residue name. The affinity of the 
WT TR is 81 + 12 p M . The relative affinity was determined by dividing the WT Kd by each 
mutant Kd. The 37 mutants tested with their relative affinities are: E217R (123%), E227R (109%) 
K242E (92o/ 0 ), E267R (117%)> H2?1R { ^ ^ ^ ^ ^ • 

D285A (89o/ 0 ), ^ ^ ^ ^ ^ ^ ^ ^ . 

E299A (17IHX I302A (M % X I302 R ( 99V o) , K306A (fl * X K306E (6%), P384R (,64»/ 0 ), A387R 
07/), E 390R (l 51 °/„), E393R (14 6o/ o) , L40 0R (95'Zo), H413R (109%), H416R (153%), M423R 
(156/o), R429A (48%), S437R (170%). L440R (174%), V444R (89%), T448R (234%) E449R 
(36%), P453E (32%), L454R (26%), L456R (46%), E457K (71%). 

Example 6: Coactivator binding assays for wild-type and mutant TRs 

Wild type (WT) TR and most of the TR mutants liganded to 3,5,3 '-triiodo-L-thyronine (T 3 ) 
bmd ecually well to the coactivator. GRIPl. I„ M case, GRIP1 binding was hormone-dependent 
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(data not shown). Mutations L454R and E457K in surface residues of helix 12 abolish GRIP1 
binding (Figure 1). Mutations in two residues of helix 3, V284R and K288A, and two residues of 
helix 5, 1302R and K306A, also impair binding (Figure 1). Five mutations with diminished GRIP1 
binding (V284R, K288A, I302R, L454R, and E457K) also show decreased binding to another 
coactivator, SRC-la (data not shown). Thus, these results show that two different coactivators 
recognize the same TR surface residues. 

Example 7: TR residues involved in ligand-dependent transcription activation in context of a 
ceU 

Residues involved in ligand-mediated transcription activation were identified by testing the 
TR mutants of Example 8 in HeLa cells. T 3 increased reporter gene activity 5-fold in cells 
expressing either WT TR or mutated TRs showing normal GRIP1 binding (representative mutants 
are shown in Figure 1. By contrast, TR mutants with diminished or absent GRIP1 binding (V284R, 
K288A, I302R, K306A, L454R, and E457K) show a diminished or absent response to T 3 which 
correlates with the GRIP1 binding defect. Overexpression of GRIP1 increases activation by the 
WT TR and rescues activation by TR mutants roughly in proportion to the severity of the defect of 
GRIP1 binding and activation (Figure 2). These results suggest that the same residues are required 
for coactivator binding, function of the endogenous coactivators) in HeLa cells, and responsiveness 
ofTRstoGRIPl. 

Example 8: Effect of TR mutations on other receptor functions 

The effects of the mutations on other receptor functions also were examined . All of the 
mutants bound radiolabeled thyroid hormone (Kd values, 6%-234% that for native receptor); 
occasional lower values were expected because some residues have partially buried side chains. 
None of the residues that decrease GRIP1 binding affected TR binding to a GST-RXR fusion 
protein or to DNA using three different DNA half-site arrangements and testing with or without 
added RXR (data not shown). Some mutations that affect GRIP1 binding occur in a region 
0 spanning helices 3-5, which has been suggested as important for TR/RXR heterodimerization 
(O'Donnell et al., supra - Lee et al., Mol. Endocrinol. (1992) 6:1867-1873). In contrast, however, the 
above results indicate that these residues do not contribute to TR/RXR heterodimerization. Further, 
TRs mutated in the CBS residues retain the ability of WT TR of T 3 -Klependent inhibition of the 
activity of the Jun and Fos transcription factors at an AP-1 site (Saatcioglu et al., suEm), suggesting 
5 that the CBS residues do not participate in TR actions mediated through these proteins. These data 
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' indict tha, ,he mutational effects are specific, the amount of i„p„, labeled TR in the different 
reacnons ,s comparable, and the levCs of expression of the mutant TRs are comparable to those of 
WT receptors. 

Example 9: Coactivator binding site in ER 

Three separate mutations (K362A, V376R, and E542K) were created in human estrogen 
receptor-cc (hERa) which align to three of the effective positions in hTR.pi (K288A, I302R and 
E457K). All three mutations diminish GRIP1 binding and abolish transcriptional activation 
(F.gure 3), and mutant V376R, with l 0 o/„ residual GRIP1 binding, was rescued partially by 
overexpression of GRIP1 (Figure 4, As a control, the ER mutants demonstrated a normal 
hormone-dependent ability to activate a vitellogenin-LUC hybrid reporter gene, GL45, which 
responds to the ER amino-terrninal activation function (Berry et al., EMBO J (1990) 92811-2818) 
(data not shown). The finding that similar residues are required for GRIP1 binding and 
transcr.pt.on activation activity in the TR and ER suggests that the coactivator binding site residues 
are similar m different nuclear receptors. 

Example 10: Coactivator NR-box binding affinity for TR 

J° StU<ly ^ intCTaCti0n b6tWeen reCept ° rS and GRIP1 in a ^« of GRIP1 

563-767) was purified that contains all three NR-boxes (Figures 6 and 7). The fragment was 

found to be highly soluble and, in agreement with a secondary structure prediction using PhD 
delays a mainly alpha-helical far UV-CD spectrum (data not shown). Three of the four helices' 
predicted for the fragment include the NR-boxes at their C-terminus, suggesting that these boxes are 
part of amphipathic alpha-helices. These results show that the NR-boxes of GRIP 1 are contained in 
a soluble, alpha-helical 24kD fragment. 

Binding assays show that GRIP1 NR-boxes 1, 2 and 3, interact differentially with hTRB 
LBD (Figure 7). A GST-fusion of the GRIP1 (563-767) fragment strongly binds TR (kD or EC50 ) 
»n a hgand depend fashion. Replacement of the hydrophobic residues of NR-box 3 with alanine 
does not reduce binding of TR significantly, whereas similar replacement of NR-box 2 results in 
loss of TR binding of about 50%. By titrating the amount of GRIP1 fragment, about a 4-fold 
stronger binding of TR for NR-box 2 (EC50 = ,. 0 uM) over NR-box 3 (EC50 = 4.0 uM) was 
estimated. In the absence of functional NR-boxes 2 and 3, almost no binding to TR was detected 
suggesting that under these experimental conditions NR-box 1 is not a cognate binding site for TR 
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5 Full length TR or TR-LBD bound GRIP1 equally. These results show that TR recognizes GRIP1 
NR-box 2 and 3, with preference for NR-box 2. 

Example 1 1 : Coactivator NR-box binding affinity for GR 

GR also was found to bind GRIP1 (563-767) in a ligand-dependent manner (Figure 8). 
However, in contrast to TR, extension of GRIP1 (563-767) to residue 1 121 increases binding to GR 
10 about 3-fold suggesting an additional binding site on GRIP1 for GR. Binding of the larger fragment 
remains ligand-dependent; no interaction can be observed in the presence of the GR partial 
antagonist RU486. These results are in agreement with in vivo 2-hybrid GR GRIP1 interaction 
studies. In the presence of ligand no difference was detected in the binding of GRIP 1 by full length 
GR or a deletion mutant of GR that lacks the N-terminal activation domain AF-1 . However in the 

15 absence of ligand, binding of GR to GRIP1 (563-1 121) increased by about 10-fold indicating that 
sequences in the GR N-terminus are able to suppress binding of unliganded GR to this additional 
binding site in GRIP1. Additionally, GR did not bind to a GRIP1 (563-767) mutant in which both 
NR-box 2 and 3 are replaced by alanines, and binds most strongly to a fragment that lacks a 
functional NR-box 2. As with TR, GR does not recognize NR-box 1 . In contrast to TR, the GR 

20 prefers NR-box 3 to NR-box 2. These results demonstrate that GR prefers binding to NR-box 3 and 
interacts with an additional GRIP1 site within the CREB (cAMP - response - element binding 
protein) - binding protein (CBP) binding site. 

Example 12: Coactivator peptide binding affinity for TR 

To investigate whether the preference of TR for NR-box 2 is dependent on the sequence or 
25 structural context of the NR-boxes, competition studies on the interaction of GRIP1 with hTRB 
LBD were performed using coactivator peptides containing different NR- boxes (NR-box 2 peptide 
(residues 11-23 of SEQ ID NO: 6) EKHKILHRLLQDS, and NR-box 3 peptide (residues 9-21 of 
SEQ ID NO: 7) ENALLRYLLDKDD) (Figure 9). Consistent with the interaction of hTR LBDB 
with GRIP1 (563-767) NR-box mutants, a peptide containing NR-box 1 competes the interaction of 
30 GRIP1 with hTRB LBD only at very high concentrations (EC50 = 130 uM). Peptides containing 
either NR-box 2 or 3 compete GRIPl (563-767) efficiently and display the preference of hTRB 
LBD for NR-box 2 (EC50 (NR-box 2) - 1.5 uM, EC50 (NR-box 3) = 4 uM). The apparent 
affinities (EC50) for peptides of NR-box 2 and 3 are comparable with the analogous GRIPl (563- 
767) NR-box mutants suggesting that the preference of TR for NR-boxes is solely dependent on the 
35 sequence and independent of the structural context of the NR-boxes. 
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Peptides of NR-box 2 or 3 compete GRIP1 (563-767) containing functional NR-boxes 2 and 
or a mutant of this fragment that contains only a functional NR-box 2 with comparable affinity 
Thus whtle TR can bind both NR-box 2 and 3, in a GRIP1 coactivator peptide fragment containing 
both boxes, TR preferentially binds NR-box 2. 

These results show the preference of TR for NR-box 2 is sequence dependent. 

The same types of assays for TR competition are performed to assess coactivator peptide 
bmdmg affinity for GR. The peptide concentrations are normalized relative to TR for obtaining 
comparable dose response curves. 



Example 13: Binding affinity of TR for extended coactivator peptides 

Sequence identity between all three central NR-boxes of the p!60 coactivator family is 
hm,ted to the conserved leucine residues of the (SEQ ID NO: 1) LxxLL motif (Figure 6) 
However, the sequence conservation of a particular NR-box can extend into neighboring residues 
To mvesttgate the contribution of these neighboring residues to affinity and specificity of the 
dxfferent NR-boxes for TR, the ability of peptides containing individual NR-boxes with different 
lengths of adjacent sequences to compete with the interaction of GRIP1 (563-767) with hTRB LBD 
20 were compared (Figure 10). 

A peptide consisting of the minimal motif of NR-box 3 (residues 12-17 of SEQ ID NO- 7- 
LLRYLL) does not compete the TR LBD interaction with GRIP1 (563-767). A peptide consisting 
of the NR-box 2 (residues 15-20 of SEQ ID NO: 6; ILHRLL) also does not sufficiently compete the 
tnteracuon (data not shown). Extending peptides containing a (SEQ ID NO: 1) LxxLL motif to 
mclude adjacent residues increased affinity for both NR-box motifs and magnified the preference of 
TR for NR-box 2 (NR-box 2 peptides: (residues 1 1-23 SEQ ID NO: 6) EKHKILHRLLQDS and 
restdues 7-23 of SEQ ID NO: 6) TSLKEKHKILHRLLQDS ; and NR-box 3 peptides: (residues 8- 
24 of SEQ ID NO: 7) KENALLRYLLDKDDTKD and (residues 5-24 of SEQ ID NO- 7) 
PKKKENALLRYLLDKDDTKD). A chimeric peptide containing the NR-box 3 motif inthe 
context of the NR-box 2 flanking sequences (SEQ ID NO: 31; TSLKEKHKLLRYLLQDSS) binds 
like a NR-box 2 peptide. 



These results demonstrates that preference of TR for NR-box 2 is at least partially due to 
features of the bound peptide (residues .5-20 of SEQ ID NO: 6; ILHRLL), but that their affinity 
and specificity is modulated by adjacent sequences. 
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5 Example 14: Binding affinity of TR and GR for mutant coactivator 
A. TR affinity for ILxxL T ■ mntif residues 

To investigate the role of the hydrophobic residues in NR-box 2, individual residues of the 
(residues 15-20 of SEQ ID NO: 6) ILHRLL motif were replaced by alanine in the background of 
GRIP1 (563-767) containing a non-functional NR-box 3 (Figure 11). Surprisingly, replacement of 
10 any of the conserved leucines prevents binding to TR almost completely. Only replacement of the 
nonconserved isoleucine exhibited a lessened but still severe impact on the affinity of NR-box 2 for 
TR. As replacement of a single leucine by alanine is sufficient to overcome the interaction of both 
the remaining hydrophobic residues and adjacent sequences with hTRB LBD, it appears that their 
contribution to the affinity of NR-box 2 for hTRB LBD is cooperative rather than additive. 

15 Similar results were obtained by competing the interaction of hTRB LBD with the GRIP1 

(563-767) NR-box 3 mutant using peptides in which either IL, HR or LL of the NR-box 2 motif are 
replaced by alanines (Figure 11). Whereas the peptides containing the IL or LL replacement failed 
to interact with the hTRB LBD even at very high concentrations, in agreement with a proposed 
alpha-helical structure of the motif, replacement of the "HR spacer" by alanines showed a marginal 

20 effect on the affinity of the peptide for TR-LBD. 

Replacement of single leucine residues of NR-box 2 by phenylalanine reduced the affinity 
of NR-box 2 peptides for TR LBD about 100-fold, replacement of the isoleucine about 10-fold 
(Figure 11). Therefore, the interaction of TR with GRIP1 relies not simply on the hydrophobicity 
of the (SEQ ID NO: 1) LxxLL motif, but also on positive contributions by the leucine residues 
25 themselves. 

These results demonstrate that single mutations of the conserved leucines in the (SEQ ID 
NO: 1) LxxLL motif strongly reduce affinity of GRIP1 for hTRB LBD. 

Collectively, the above examples demonstrate that peptides containing NR-boxes, 
particularly NR-box 2, reproduce the affinity and specificity of the interaction of GRIP 1 (563-767) 
30 with hTRB LBD. 

B. TR affinity of FxxLW and FxxAL mntif residues 

The three conserved leucines of the NR-box 2 (SEQ ID NO: 2) ILxxLL motif are embedded 
in the hydrophobic cleft of the hTRB LBD:NR-box 2 interaction surface, whereas the non conserved 
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5 isoleucine is located on the rim of this cleft where structural changes can be more easily 
accommodated (See Example 18). In agreement with this structure, replacement of this residue by 
alanine or phenylalanine reduced binding to hTRB LBD to a less extent than the comparable 
mutations of the conserved leucine residues. The surface generated by the three conserved leucines 
(L690, L693, L694) of the NR-box 2 peptide (residues 12-24 of SEQ ID NO: 6) 686- 

10 KHKILHRLLQDSS-698 is highly complementary to the corresponding binding site in the hTRB 
LBD (Figures 16 and 17). Comparison of this binding site to other nuclear receptors shows that it 
contains a structural motif that is unique, highly conserved and present in all known structures of 
nuclear receptor LBDs (Wurtz et al., Nat Struct Biol (1996) 5:87-94; Wagner et al, supra: Renaud 
et al., Nature (1995) 575:681-689; Bourguet et al., Nature (1995) 575:377-382; and Brzozowski et 

15 al., Nature (1997) 559:753-758). 

Interaction of highly conserved hydrophobic motifs, which are part of amphipathic alpha- 
helices, with complementary hydrophobic surfaces resembles a feature observed for the interaction 
of several other transcriptional activators with their target proteins (p53:MDM2, VP16:TAFII31 or 
CREB:KIX-CBP). However, the motifs of p53 (FxxLW), VP 16 (FxxAL) and CREB (YxxIL) 

20 differ from the (SEQ ID NO: 1) LxxLL motif of nuclear receptor coactivators. A Fxxxh motif may 
be generally involved in interaction with TAFII31, where "h" represents any hydrophobic residue. 
Though with respect to the known structures, complementarity of the interacting hydrophobic 
surfaces identified here seem to be a common feature of these interactions, cross-reactions between 
different motifs are possible. For instance, VP 16, p53, and p65 (FxxFL) are able to functionally 

25 interact with TAFII31, or p53 and E2F1-DP1 (FxxLL) both interact with MDM2. These 
interactions are sensitive to mutations in the Fxxxh motif. Therefore it appears that either 
complementarity of the hydrophobic surfaces is not an absolute requirement or that induced fitting 
of the interacting surfaces is possible. 



30 



35 



Based on these observations, studies were performed to determine whether GRIP1 interacts 
with TAFII31 or MDM2. However, no interaction was detected. GRIP1 mutants changing NR-box 
2 ( SEQ ID NO: 1 ; LxxLL) to VP16 ( SEQ ID NO: 4; FxxAL) or p53 (SEQ ID NO: 3; FxxLW) like 
binding sites also failed to bind TAFII31 or MDM2 demonstrating that the presence of the correct 
binding site is not sufficient to create binding (data not shown). Moreover, peptides containing the 
VP 16 or p53 binding sites are not able to compete the interaction of GRIP 1 with TR, even in very 
high concentration, but do compete the interaction with GR (data not shown). The affinity of this 
interaction is weak, but comparable to affinity of a peptide of NR-box 2 that, in the context of a 
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Jo 

5 GRIP1 mutant lacking NR-box 3, binds GR in vivo (Ding et al., supra ). This binding is only about 
ten times less than a peptide containing NR-box 3, GR's primary binding site. 

As shown above, GR binds GRIP1 (563-767) with about one-fifth the affinity than a 
comparable amount of TR. Thus, the high concentration of NR-box 3 peptide required to compete 
the interaction of GR with GRIP1 (563-767) may rather reflect a weak affinity of GR for the 
10 peptide rather than a particular strong interaction of GR with GRIP1 (563-767). 

These results suggest that at least on the peptide level, other hydrophobic motifs besides 
(SEQ ID NO: 1) LxxLL can interact with the coactivator binding site, but that it is receptor 
dependent. 

C. TR affinity for residues adjacent to ILxxLL motif 

15 Peptides containing a FxxLL motif bind TR but with two orders of magnitude lower affinity 

than a (SEQ ID NO: 1) LxxLL motif (Figure 11). To test whether the additional changes in the 
hydrophobic motif or adjacent sequences of the VP 16 peptide prevent its binding to TR, a chimeric 
peptide containing the NR box-2 motif (SEQ ID NO: 1) LxxLL in the context of the VP 16 sequence 
was constructed. This peptide binds to TR but with an about 100-fold lower affinity than the 

20 original NR-box 2 peptide. Thus, the inability to bind the VP 16 peptide appears to be due to the 
combination of an imperfect hydrophobic motif and the incompatibility of TR to adjacent sequences 
of the VP 16 motif. 

As the interaction of the chimeric peptide with GR was comparable to the original NR-box 2 
and VP 16 peptides, this incompatibility appears due to TR-specific features in the NR-box 
25 interaction surface. These results show sequences adjacent the NR-box motif LxxLL can reduce 
binding of NR-box 2 to TR, but not GR. 

Example 15: Crystallization and Structure Determination of NR LBD Complexes 

A. Crystallization of hTRB LBD with Ti and GRIP1 NR-box 2 Peptide 

Several peptides containing GRIP1 NR-box 2 were tested in crystallization trials with the 
30 hTRB LBD. The complex of the hTRB LBD with the GRIP1 NR-box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6) produced crystals that were dependent 
on both the presence and the concentration of the peptide. Without the peptide, the hTRB LBD 
precipitated immediately. However, nucleation was erratic, but could be overcome through seeding 
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of prepared drops with microcrystais of the hTRB LBD.GRIPl NR-box 2 peptide complex. 
Structure of the hTRB LBD:GRIP1 NR-box 2 peptide complex was determined by molecular 
replacement using the structure of the hTRB LBD determined previously (Wagner et al., supra) , and 
refined to a resolution of 3.6A (Table 1). The refined model consists of residues K21 1-P254 and 
V264-D461 of monomer 1 of the hTRB LBD, residues K21 1-P254 and G261-D461 of monomer 2 
of the hTRB LBD, and the GRIP1 NR-box 2 peptides (residues 14-24 of SEQ ID NO: 6) 688- 
KILHRLLQDSS-698, and (residues 14-22 of SEQ ID NO: 6) 688-KILHRLLQD-696 (Appendix 

Briefly, the complex between the hTRB LBD and the GRIP1 NR-box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6) was prepared by mixing (equal) 
volumes of a solution of 9mg/ml hTRB LBD in 20mM HEPES pH 7.4 with a solution of 14 mM 
GRIP1 in 0.4mM ammonium acetate pH 4.72, and incubating the mixture on ice for 1 hour. 
Crystals were obtained after 2 days at 4°C using hanging drop vapor diffusion from a drop 
containing 1.5ul of hTRB LBD:GRIP1 complex, prepared as described, and 0.5ul 15%PEG 4K, 
0.2M sodium citrate pH 4.9, suspended above a reservoir containing 10% PEG 4K, 0.1M 
ammonium acetate, and 0.05 M sodium citrate (pH 5.6). After allowing the drop to equilibrate for 1 
hour, 0.2ul of 10-3 to 10-5 dilutions of microcrystais in reservoir buffer were introduced to provide 
nucleation. Crystals are of space group P3121 (a=95.2, b=95.2, c=137.6) and contain two 
molecules of the hTRB LBD and two molecules of the GRIP1 NR-box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6). 
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Table 1 

Data collection, phasing, and refinement statistics 



Data collection 


Data set 


Resolution 
(A) 


Reflections 


Coverage (%) 




Native 


3.6 


measured 


unique 


96.3 


0.007 


35565 


8490 




Rotation search 


Search model 


Euler angles (°) 


Correlation coefficient 


e, 


e 2 




Highest peak 


Highest false peak 


hTR p LBD 


Ml 


60.12 


80.68 


241.90 


16.3 




M2 


9.93 


87.70 


180.6 


15.9 


14.2 




Translation search 




Fractional coordinates 


Translation function 


X 


y 


z 


Highest peak (o) 


Highest false peak (o) 




Ml 


0.522 


0.428 


0.250 


19.52 


10.02 


M2 


0.200 


0.932 


0.119 


26.11 


5.77 




Refinement 




Resolution 
(A) 


Reflection 


R 




F>2( 


25 - 3.7 


7614 


0.2990 


0.3219 


All data 


25 - 3.7 


7851 


0.3010 


0.317 





R«ym = Zh ^ | I Ki u (/ h ( | / Z k for the intensity (I) of / observations of reflection h. 

Correlation coefficient = X h Eo 2 Ec 2 - Eo 2 Ec 2 l [Z h (Eo 2 - Eo 2 ) 2 Z h {Ec 2 -Ec 2 f] m 

Translation function (t a , t b , . . .) = Z h (I£o (h) | 2 - Z h <\Eo (h) \ 2 >) (Ec (Ma,t b ,...)| 2 - <\Ec (h) \ 2 ) 
where E 0 represents the normalized observed structure factor amplitudes, and 
E c represents the normalized structure factors for the search model in a 
triclinic unit cell with dimensions identical to that of the crystal. The 
reported peak height represents the value of the function for the translation 
(t a , tb) of the NCS monomers, divided by the rms value of the translation 
function density. 

R factor = Z | F obs - | / Z | F obs |. 

Rfrce is calculated the same as R factor, except only for 1 0% of the reflections that 
were set aside for cross validation and not used in refinement 



BNSDOCID; <WO 996001 AA2J_> 



10 



15 



20 



WO 99/60014 PCT/US99/06899 

39 

5 B. Crystallization of hERg LBD with PES and GRIP1 NR-box 2 Peptide 

Crystals of a DES-hERa LBD-GRIP1 NR-box 2 peptide complex were obtained by hanging 
drop vapor diffusion. Prior to crystallization, the DES-hERa LBD (residues 297-554) complex was 
incubated with a 2-4 fold molar excess of the GRIP1 NR-box 2 peptide 686-KHKILHRLLQDSS- 
698 (residues 12-24 of SEQ ID NO: 6) for 7-16 hr. Two p.L samples of this solution were mixed 

10 with equal volume samples of reservoir buffer consisting of 25-27% (w/v) PEG 4000, 90 mM Tris 
(pH 8.75-9,0) and 180 mM Na Acetate and suspended over wells containing 800 of the reservoir 
buffer. After 4-7 days at 19-21°C, rod-like crystals were obtained. The coactivator complex 
crystals lie in the spacegroup P2| with cell dimensions a=54.09, b=82.22, c=58.04 and P=111.34. 
Two molecules each of the DES-LBD and the coactivator peptide form the asymmetric unit A 200 

15 jam x 40 \im x 40 \xm crystal was transferred to a cryosolvent solution containing 25% (w/v) PEG 
4000, 10% (w/v) ethylene glycol, 100 mM Tris (pH 8.5), 200 mM Na Acetate and 10 nM peptide 
and frozen in an N2 stream at -170°C in a rayon loop. Diffraction data from this crystal were 
measured at -170°C using a 300 mm MAR image plate at the Stanford Synchrotron Radiation 
Laboratory (SSRL) at beamiine 7-1 at a wavelength of 1.08 A. The diffraction images were 

20 processed with DENZO and scaled with SCALEPACK (Otwinowski, et al., Methods EnzymoL 
(1997) 276:307-326) using the default -3a cutoff. 

C, Crystallization of hERa LBD with OHT 

Crystals of the hERa LBD (residues 297-554) complexed to OHT were obtained by the 
hanging drop vapor diffusion method. Equal volume aliquots (2 pL) of a solution containing 3.9 

25 mg/mL protein-ligand complex and the reservoir solution containing 9% (w/v) PEG 8000, 6% (w/v) 
ethylene glycol, 50 mM HEPES (pH 6.7) and 200 mM NaCl were mixed and suspended over 800 
jiL of the reservoir solution. Hexagonal plate-like crystals formed after 4-7 days at 21-23°C. Both 
crystal size and quality were improved through microseeding techniques. These crystals belong to 
the space group P6 5 22 with cell parameters a=b=58.24 A and o=277.47 A. The asymmetric unit 

30 consists of a single hERa LBD monomer; the dimer axis lies along a crystallographic two-fold. A 
single crystal (400 jim x 250 x 40 ^m) was briefly incubated in a cryoprotectant solution 
consisting of 10% (w/v) PEG 8000, 25% (w/v) ethylene glycol, 50 mM HEPES (pH 7.0) and 200 
mM NaCl and then flash frozen in liquid N2 suspended in a rayon loop. Diffraction data were 
measured at -170°C using a 345 mm MAR image plate at SSRL at beamiine 9-1 and at a 
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wavelength of 0.98 A. The diffraction images were processed with DENZO and scaled with 
SCALEPACK (Otwinowski, et al., supra ) using the default -3a cutoff. 

Example 16: Structure determination and refinement of NR LBD complexes 

A. Structure o f hTRB LBD with T 2 and GRIP1 NR-box 2 Peptide 

Data were measured using Cu Ka radiation from an R-axis generator at 50 kV and 300 mA 
with a 0.3mM collimator and a Ni filter. Reflections were measured using an R-Axis II detector 
and integrated with Denzo, and equivalent reflections scaled using Scalepack (Otwinowski and 
Minor, "Processing of x-ray diffraction data collected in oscillation mode." In Macromolecular 
Crystallography, Part A (ed. C.W. Carter, Jr. and R.M. Sweet), pp. 307-326. Academic Press, New 
York, NY). Possible rotation function solutions were calculated using normalized amplitudes in 
15 AMORE from a model of hTRfi LBD with the ligand, T 3 , omitted; translation function solutions 
were subsequently determined using TFFC for the two rotation solutions with the highest 
correlation coefficients. For two hTRB LBD molecules in the asymmetric unit, the calculated 
solvent content is 52%. After rigid body refinement of the two hTRB LBD molecules, electron 
density maps were calculated. Strong positive density present in both the anomalous and 
conventional difference Fourier maps for the iodine atoms of the T 3 ligand confirmed the 
correctness of the solution. The iodine atoms for both T 3 ligands were modeled as a rigid body, and 
the structure refined with strict NCS symmetry using CNS. Both 2FoFc and FoFc electron density 
maps showed interpretable density, related by the NCS operator, near HI 2 of both molecules of the 
hTRB LBD. The electron density could be modeled as a short a-helix, and the observed side chain 
25 density was used to tentatively assign the sequence and direction to the chain. The refined model 
consists of residues of the hTRB LBD, and peptide residues of the GRIP1 NR-box 2 peptide 686- 
KHKILHRLLQDS S-698 (residues 12-24 of SEQ ID NO: 6). 

Atomic coordinates of the hTRB LBD:GRP1 site 2 peptide complex are attached as 
Appendix 1. 

30 B - Structure o f hERnt LBD with PES and GRIP1 NR-box 2 Peptide 

Initial efforts to determine the structure of the DES-hERa LBD-NR box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6) complex utilized a low resolution (3.1 
A) data set (data not shown). A self-rotation search implemented with POLARRFN ("The CCP4 
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5 suite: programs for prottin crystallography-, Ada Crys.allogr. (1994) 050:760-763) indicated the 
presence of a noncrysullographic dyad. The two LBDs in the asymmetric were located by 
moleculat replacement in AMoRe (CCP4, 1994) using a partial po.yalanine mode, of the human 
RARr LBD (Renaud, e. al, sj^ as , he ^ probe (R=J8 . 2%> cc . 35 . 6% ^ ^ 
both monomers). Given tha, the model at this point was bod, inaccurate (,m.s.d. 1.7 A between 
Uus mode, and the final mode, based on Ca positions) and incomplete (accounting for on.y -45% 
of the to*, scattering matter in the asymmetric unit), an aggressive density modification prrrtoco. 
was undertaken. Iterative cydes of two-fold NCS averaging in DM (CCP4, 1994) interspaced with 
mode, bunding m M OLOC (Muller, e, al, Bull. Soc. Ou^ Bel g . (1988) 97:655-667) and model 
refinement in REFMAC (Murshudov, e. a,., Aca Crys!allogr . (1997) D5 , aM , 2s5) ^ ^ 
NCS results) were used to quickly bui,d a mode, of the LBD alone. For mis procedure, MAMA 
(KJeywegt, c, al, "Halloween...masks aM bone, In From Firs, Map to Final Model-, Bailey e, a. 
eds„ Warrington. England. SERC Daresbury Laboratory, ,994) was used for all mask' 
mansions and PHASES (Furey, e. a,., PA33 An,. Crys, Assoc. Mr g . Aos,r. (1990) ,8:73) and 

20 wiT SU " e (CCP4 ' '" 4) ^ "" d ° f ' Mm •* ** «* 

However, although the DES-hERa LBD-NR complex model accounts for -90% of the 
scanenng matter in me asymmetric unit, refinement was being hampered by severe model bias 

The high-resolution data set of the DES-hFRn, r Rn x,d u -> 

me nfcRa LBD-NR-box 2 peptide complex became available 

when the R free of the OHT-hERa LBD model was -31% TW, ™ • *u 

ueiwas J1/o - Both monomers m the asymmetric unit of 

-5 the DES complex crystal were relocated using AMoRe and the incompletely refined OHT-hERa 
LBD model (with helix ,2 ^ *c loop between helices 1 , and ,2 removed, as the search model 
The nussmg parts of me model were built and the rest of the model was corrected using MOLOC 
and two-fold averaged maps generated in DM. Initially, refinement was carried out with REFMAC 
ustng „gh, NCS restraints. At later stages, the mode, was refined without NCS restraints using the 
smtulated armealmg, minimization and B-fac,or refinement protocols in X-PLOR and a maximum- 
hkeuhood target All B-fac,ors were refined isotropically and anisotropic scaling and a bulk soiven. 
correoton were used. The R^ set contained a random samp.e of 6.5% of al, data. ,„ refinement 
all data between 27 and 2.03 A (with no o cutoff) were used. The final mode, was composed of 
restdues 305-549 of monomer A, residues 305-46, and 470-554 of monomer B, residues 687-697 of 
pepttde A, residues 686-696 of pept.de B, ,64 waters, two carboxymethy, groups and . chloride 
.on. Accordmg ,„ PROCHECK. 93.7% of al. residues in the mode, were in the core regions of the 
Ramachandran plot and none were in the disallowed regions. Thus, the structure of me DES-hERa 
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5" LBD-NR-box 2 peptide complex has been refined to a crystallographic R-factor of 19.9% 
(R&ee=25.0%) using data to 2.03 A resolution. 

He 689 from the peptide interacts with three receptor residues (Asp 538, Giu 542 and Leu 
539). The y-carboxylate of Glu 542 forms hydrogen bonds to the amides of residues 689 and 690 of 
the peptide. A water-mediated hydrogen bond network is formed between the imidazole ring of His 

10 377, the y-carboxylate of Glu 380, and the amide of Tyr 537. Three residues (Glu 380, Leu 536 and 
Tyr 537) interact with each other through van der Waals contacts and/or hydrogen bonds. 
Intriguingly, mutations in each these three residues dramatically increase the transcription activity 
of unliganded ERa LBD (Eng, et al., Mol. Cell. Biol. (1997) 17:4644-4653); Lazennec, et aL. Mol. 
Endocrinol. (1997) 1 1:1375-86; White, et al., EMBO J. (1997) 16:1427-35). Atomic coordinates of 

15 DES-LBD-peptide complex are attached as Appendix 2. 
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Table 2 

Summary of Crystallographic Statistics 

Lieand 



Data Collection 
Space group 
Resolution 
Observations 
Unique 

Completeness (%) 
Average I/oI 



PES 

P2, 

2.03 
104189 
30265 

98.4 

7.8 

9.8 



OHT 
P6 5 22 
1.90 
269253 
23064 
99.1 
7.0 
16.1 



Refinement 

Number of non-hydrogen atoms 
R«ys« (%) b /Rfi« (%) 
Bond r.m.s. deviation (A) 
Angle r.m.s. deviation (°) 
Average B factor (A 2 ) 



25 



4180 2070 

19.9/25.0 23.0/26.1 

0.006 0.006 

105 1.05 

34.0 40.4 
Rsym I Ii- <Ii> | /Zili where <Ij> is the average intensity 
over symmetry equivalents 
Rcnrs. = Z|F 0 -F c |/5:|F 0 | 



30 



35 



C * Structure of hERoc I.Rrui ffr comp ly 

The OHT complex data set was then collected. Starting with one of the monomers of the 
prehmmary low-resolution DES-hERa LBD-NR-box 2 peptide model as the search probe 
mo ecu ar replacement in AMoRe was used to search for the iocation of LBD in this crystal form in 

CC P ^ P6522 " A tianSlati0n " P6s22 yidded COrreCt <R-*.8H 

CC-38.2/0). I„ order to reduce model bias, DMMULTI (CCP4, 1994) was then used to project' 

averaged density from the DES complex cell into the OHT complex cell. Using MOLOC, a model 

of the hERa LBD was built into the resulting density. The model was refined initially in REFMAC 

and later wth the simulated annealing, positional and B-factor refinement protocols in X-PLOR 
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(Brunger, X-PLOR Version 3.843, New Haven, Connecticut: Yale University, 1996) using a 
maximum-likelihood target (Adams, et al., Proc. Natl. Acad. Sci. USA (1997) 94:5018-23). 
Anisotropic scaling and a bulk solvent correction were used and all B-factors were refined 
isotropically. Except for the R free set (a random sampling consisting of 8% of the data set), all data 
between 41 and 1.9 A (with no a cutoff) were included. The final model consisted of residues 306- 
551, the ligand and 78 waters. According to PROCHECK (CCP4, 1994), 91.6% of all residues in 
the model were in the core regions of the Ramachandran plot and none were in the disallowed 
regions. Thus, the structure of the OHT-hERa LBD complex has been refined against data of 
comparable resolution (1.90 A) to a crystallographic R-factor of 23.0% ^=26.2%). Atomic 
coordinates of OHT-hERa LBD complex are attached as Appendix 3. 

Example 17: Structural analysis of hTRB LBD:GRIP 1 NR-box 2 peptide complex 
A. Structure of cocrvstal complex (conten ts of asu) 

The asymetric unit (asu) of the crystal contains two monomers of the hTRB LBD and two 
molecules of the GRIP1 NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ 
ID NO: 6), which observes the NCS relation of the two TR monomers (Figure 12). The structure 
of the hTRB LBD, which closely resembles that of the rTRa LBD (Wagner et al., smra), consists 
of twelve alpha-helices and two B-strands organized in three layers, resembling an alpha-helical 
sandwich. The only significant difference between the hTRB LBD and the rTRa LBD is disorder in 
the loop between helices HI and H3. The GRIP1 NR-box 2 peptide forms an amphipathic a-helix 
of about 3 turns, preceded by 2 residues and followed by 3 residues in extended coil conformation. 

The relation of the two monomers of the hTRB LBD is primarily translational, and does not 
resemble the homodimer structures reported for the hRXR, or the hER (Bourguet et al., m>ra; 
Brzozowski et al., supra) . Furthermore, the interface between the two monomers does not involve 
residues necessary for formation of the physiological TR dimer. Instead, one of the cocrystal 
peptides appears to bridge the interaction between the two monomers. The hydrophobic face of the 
alpha-helix of the cocrystal peptide contacts monomer 1 of the hTRB LBD at H3, H5, and H12, 
while the hydrophilic face contacts monomer 2 at the hairpin turn preceding strand S3. The second 
cocrystal peptide also contacts monomer 2 at H3, H5, and H12, and the two cocrystal peptides 
observe the same NCS relation as TR LBD monomers. 
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The common interface between both cocrystal peptides and the hTRB LBD buries the 
hydrophobe residues that define the cocrystal peptide (SEQ ID NO: 1) LxxLL sequence motif 
revues Ile689, Leu690, Leu693, and Leu694; against the surface of the receptor LBD (Figures 16 
and 17). The presence of the second peptide in the crystal, duplicating the interactions of the 
hydrophobe residues, suggests those interactions are specific and drive the interaction of the 
peptide with the hTRB LBD, while the hydrophilic interactions provide a fortuitous crystal contact 
and account for the dependence of crystallization on the presence and concentration of the peptide. 

B * Structure of the CRTP1 N R_ b ox 2 p eprirf* 

,2 M NR ~ b ° X 2 P6Ptide ^ " ^ CiyStaIIi2ation is » -*» -ids long (residues 

2-24 of SEQ ID NO: 6; 686-KHKILHRLLQDSS-698). For the NR-box 2 peptide in monomer 1 
peptide 1), 12 amino acids are ordered in the crystal. Residues K688 - Q694 form an amphipathic 
hehx, with residues K686-H687 and D695-S698 on either end in extended coil conformations. For 
*e NR-box 2 peptide in monomer 2 (peptide 2), residues K688 - Q694 again form an amphipathic 
he,*, but me cnds of me peptide „ ^ ^ ^ ^ ^ 

absolute assignment of hydrogen bonds, it is evident from the periodicity of the side chain density 

ZTZT fonn " aIpha " heUX In abSenCC ° fTR * e far UV-CD spectrum of the 

GRIP1 NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO- 6) appears 
tc .be .random coil (data not shown). Stable helix formation may thus be induced by the interaction 
of the hydrophobic amino acids with the receptor LBD as it has been proposed in other 
prote in :protein interactions, such as P 53:MDM2 (Kussie et al., Science (1996) 27^948-953) 
VP16:TAF31 (Uesugi et al., Science (1996) 277:1310-1313), and CREB:KIX-CBP (Radhakrishnan' 
etal., Cell (1997) 97:741-752). 



30 



C * §*""*ure of the HTRB I, BD:GRTP1 NR-hov 7 p ^H. 

The hTRB LBD of the cocrystal contributes residues from three helices, H3, H5. and H12 to 
the mterface, which pack against one another to create a hydrophobic cleft. The residues lining the 
cleft are ,280, T281, V283, V284, A287, and K288 from H3; Q301, 1302, L305, and K306 from 
H5; and L454, E457, V458, and F459 from H12. A cysteine residue (C309) from H6 appears to 
provuie a partial surface that is buried deep within the bottom of the cleft. 

The GRIP1 NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO- 
6) bmds at the junction of H3 and H12. Leu690 of the bound peptide inserts into a shallow but 
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defined depression at the base of the cleft, making van der Waals contact with L454 and V458 of 
H12, while peptide residue Ile689 packs against L454 of HI 2 outside the edge of the cleft; L454, 
then, interdigitates between the two residues. One further turn C-terminal along the alpha-helix, 
L693 and L694 of the bound peptide pack into complementary pockets within the hydrophobic 
cleft. Peptide residue L693 forms van der Waals contact with V284 of H3, while peptide residue 
L694, bound more deeply in the cleft, makes contact with F298 and L305 of H4 and H5. The 
hydrophobic interactions of the GRIP1 NR-box 2 peptide with the hTRfi LBD are observed for both 
cocrystal peptides 1 and 2 in their respective monomers of the crystal dimer complex, suggesting 
that the interactions are specific to the peptide, and not induced by crystallization. , 



Example 18: Overall Structure of the DES-hERa-LBD-NR-box 2 Peptide Complex 

15 The asymmetric unit of the DES-hERcc LBD-NR-box 2 peptide 686-KHKILHRLLQDSS- 

698 (residues 12-24 of SEQ ID NO: 6) complex crystals contains the same noncrystallographic 
dimer of LBDs that has been observed in the previously determined structures of the LBD bound to 
both E 2 and RAL (Brzozowski, et al., supra and Tanenbaum, et al., supra). Beyond the flexible 
loops between helices 2 and 3 and helices 9 and 10, the two LBDs of the dimer adopt similar 

20 structures (r.m.s.d. 0.47 A based on Ca positions). The conformation of each LBD complexed with 
DES closely resembles that of the LBD bound to E 2 (Brzozowski, et al., s«2CS); each monomer is a 
wedge shaped molecule consisting of three layers of eleven to twelve helices and a single beta 
hairpin. In each LBD, the hydrophobic face of helix 12 is packed against helices 3, 5/6 and 11 
covering the ligand binding pocket. One NR-box 2 peptide is bound to each LBD in a hydrophobic 

25 cleft composed of residues from helices 3, 4, 5 and 12 and the turn between 3 and 4. The density 
for both peptides in the asymmetric unit is continuous and unambiguous. Residues 687 to 697 from 
peptide A and residues 686 to 696 from peptide B have been modeled; the remaining residues are 
disordered. Given that each peptide lies within a different environment within the crystal, it is 
striking that from residues He 689 to Gin 695 each peptide forms a two turn, amphipathic a helix. 
30 Flanking this region of common secondary structure, the peptides adopt dissimilar random coil 
conformations. 

Example 19: Structure of the OHT-hERa LBD Complex 

The binding of OHT induces a conformation of the hERa LBD that differs in both 
secondary and tertiary structural organization from that driven by DES binding. In the DES 
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— chain from residues 339 ,o 34!, 42, ,o 423. and 527 * 530 fonn nam of helices 
3, and > , respectively. In contrast these regions adop, an exttnded conformation in the OHT 
complex. In addition, the composition and orientation of helix ,2 a™ different in the two sutures 
Hehx ,n the DBS complex consists of residues 538 to 546 whereas helix 12 in the OHT complex 
conststs of residues 536 to 544. Most dramatical rather than covering the ligand binding pd, 
as .. does ,„ the DBS complex, helix ,2 in the OHT convex occupies the part of the JLtivator 
buxhng groove formed by residues ftom helices 3, 4, and 5, and the turn connecting helices 3 and 4 
Thts alternative conformation of helix .2 appears to be similar to that observed in the RAL comptex 
(Brzozowski, et al., suoraY P 



Example 20: Coactivator binding site structure and function 
15 A " TR coat- Hva tor binding 

Theaboveexamp.esdemonsrra^thatnuclearrecepto., exemplified by TR, GR and ER, are 

hTdirt- , r *" bind ° ,te ° ush * * «*- a 

hydrophob.c cleft and a charged hydrophobic perimeter. Identification and chanson ot Ms 
20 destgn and se,ec,io» of compounds mat modulate binding of coactivator ,o nuclear recepl. 

Residues forming the coactivator binding site were found to clusKr within a surprisingly 
-all area wrth weU-defined borders (see, e.g„ Flg „ r es 5, 14, and 15). As is shown in above 
Examples, mutated residues nearby this area do no, affec, coactivator binding or transcriptional 
activation. Additional.* me coactivator binding assays and struck, arises demonstiated ma, 
•5 NR-boxcontauung proband peptides bind to mis site. Tnese resu!ts also showed thatthe 
GRIP, coac«va,or protein binds to the site through a highly (SEQ ID NO: 1) LxxIX. 

(SEO ir^tTlTr 1 ShOWed *" conrac,in8 3 teucin = ™** °f 

(SEQ ID NO: I) LxxU. motif included V284, F293, 1302, L305 and L454. Residues within 4 5A 

» L454 Z r«57 'si""" inC ' Uded T2S1 - V284> K288 ' ^ Q3 °'- 13 ° 2 - "° 5 - "<* 

L454andE457. Suuctum, analyses also revealed two other features of the site: a hydrophobic 

bTItix to! 2 (Phe459) conWbu,es ,o ,ocal ^ - d ' »— 

by he. x (Cys309> that provides a partial surface buried deep within the site. Mutationa. ana.yses 

: Z ' ^ V45S - MUKKd reSidues lik ^ » «"-*> » conformational 

change upon hormone binding inc.uded Leu454 and G.u457. Thus, the site identify by 
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5 mutational, binding assays and crystallography corresponds to a surprisingly small cluster of 
residues on the surface of the LBD that define a prominent hydrophobic cleft formed by 
hydrophobic residues corresponding to human TR residues of C-terminal helix 3 (Ile280, Val283, 
Val284, and Ala287), helix 4 (Phe293), helix 5 (Ile302 and Leu305), helix 6 (Cys309), and helix 12 
(Leu454, Val458 and Phe459). Collectively, the Examples indicate that residues forming the site 

1 0 are amino acids corresponding to human TR residues of C-terminal helix 3 (Ile280, Thr28 1 , 

Val283, Val284, Ala287, and Lys288), helix 4 (Phe293), helix 5 (Gln301, Ile302, Leu305, Lys306), 
helix 6 (Cys309), and helix 12 (Pro453, Leu454, Glu457, Val458 and Phe459). The coactivator 
binding site is highly conserved among the nuclear receptor super family (Figure 19). 

The coactivator binding site of TR contains charged and hydrophobic residues at its 
15 periphery, but only hydrophobic residues at its center (see, e.g.. Figures 5 and 18). The 
hydrophobic cleft at the center of the site may play a significant role in driving the coactivator 
binding reaction. The site is comprised of two parts (Figure 18), right). Residues contained in 
helices 3, 5 and 6 (Figure 18, yellow residues) likely form a constitutive part, since their positions 
are identical in all nuclear receptor structures reported, including the liganded, activated states of 
20 the TR, RAR, and ER, the unliganded RXR, and the inhibitor-liganded ER. By contrast, the 
residues of helix 12 (Figure 18, red residues) are differently positioned in the active and inactive 
states reported. Thus the coactivator binding site for the nuclear receptors is likely to be formed in 
response to an active hormone by positioning helix 12 against a scaffold formed by helices 3-6. 
Because the coactivator binding site is so small, it is easy to understand how even slight changes in 
25 the position of helix 12, which may, for example, be induced by an antagonist ligand, could impair 
coactivator binding, and thus receptor activation. 



B. ER coactivator binding site 

Binding of the NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID 
NO: 6) to the ERa LBD buries 1000 A 2 of predominantly hydrophobic surface area from both 

30 molecules. The NR-box 2 peptide binding site is a shallow groove composed of residues Leu 354, 
Val 355, He 358, Ala 361 and Lys 362 from helix 3; Phe 367 and Val 368 from helix 4; Leu 372 
from the turn between helices 3 and 4; Gin 375, Val 376, Leu 379 and Glu 380 from helix 5; and 
Asp 538, Leu 539, Glu 542 and Met 543 from helix 12. The floor and sides of this groove are 
completely nonpolar, but the ends of this groove are charged. Therefore, structural characterization 

35 of the binding site of the NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 1 2-24 of SEQ 
ID NO: 6) to the ERa LBD, which is the same NR-box 2 peptide utilized to crystallize the T 3 -TR 
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LBD, supports the findings for TR that residues forming the coactivator binding site of nuclear 
receptors ,s composed of a well defined hydrophobic cleft and a charged hydrophobic perimeter 
These residues are highly conserved among the nuclear receptor super family (Figure 19) 
Structural characterization of the coactivator peptide-bound ER LBD also supports the concept of 
exploiting the slight differences among the coactivator binding sites of nuclear receptors in 
designing and identifying compounds that target specific nuclear receptors. 

The ERa LBD interacts primarily with the hydrophobic face of the NR-box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6) a helix formed by the side chains of 
He 689 and the three (SEQ ID NO: 1) LxxLL motif leucines (Leu 690, Leu 693 and Leu 694) The 
side cham of Leu 690 is deeply embedded within the groove and forms van der Waals contacts with 
the side chains of lie 358, Va, 376, Leu 379, GIu 380 and Met 543. The side chain of Leu 694 is 
similarly isolated within the groove and makes van der Waals contacts with the side chains of He 
358, Lys 362, Leu 372, Gin 375, Val 376 and Leu 379. In contrast, the side chains of both He 689 
and the second NR box leucine, Leu 693, rest against the rim of the groove. The side chain of He 
689 lies ,n a shallow depression formed by the side chains of Asp 538, Leu 539 and Glu 542 The 
s.de cham of Leu 693 makes nonpolar contacts with the side chains of He 358 and Leu 539. 

The charged and polar side chains which form the hydrophilic face of the peptide helix 
project away from the ERa receptor and either interact predominantly with solvent or form 
symmetry contacts. None of the side chains of the polar and charged residues outside the helical 
region of either peptide in the asymmetric unit, with the exception of Lys 688 of peptide B is 
involved in hydrogen bonds or salt bridges with its associated ERa LBD monomer. The s-amino 
group of Lys 688 of peptide B hydrogen bonds to the side chain carboxylate of GIu 380 of 
monomer B. This interaction is presumably a crystal artifact; the main chain atoms of the N- 
terminal three residues of peptide B are displaced from monomer B and interact extensively with a 
symmetry-related ERa LBD. 

In addition to interacting with the hydrophobic face of the peptide helix, the ERa LBD 
stabilizes the main chain conformation of the NR box peptide by forming capping interactions with 
both ends of the peptide helix. Glu 542 and Lys 362 are positioned at opposite ends of the peptide 
binding sue. The side chains of Glu 542 and Lys 362 form van der Waals contacts with main chain 
and S1 de cham atoms at the N- and C-terminal turns of the peptide helix respectively. These 
interactions position the stabilizing charges of the y-carboxylate of Glu 542 and s-amino group of 
Lys 362 near the ends of the NR box peptide helix. The side chain carboxylate of GIu 542 
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5 hydrogen bonds to the amides of the residues of N-terminal turn of the peptide helix (residues 688 
and 689 of peptide A; residues 689 and 690 of peptide B). Similarly, the E-amino group of Lys 362 
hydrogen bonds to the carbonyls of the residues of the C-terminal turn of the peptide helix (residue 
693 of peptide A; residues 693 and 694 of peptide B). 

Except for the orientation of helix 12, the structure of the peptide binding groove of the ERcc 
1 0 LBD is almost identical in the DES and OHT complexes. The region of this groove outside of helix 
12 is referred to herein as the "static region" of the NR box binding site. Helix 12 in the OHT 
complex and the NR box peptide helix in the DES complex interact with the static region of the 
coactivator recognition groove in strikingly similar ways. 

Helix 12 mimics the hydrophobic interactions of the NR box peptide with the static region 
15 of the groove with a stretch of residues (residues 540 to 544) that resembles an NR box ((residues 6- 
10 of SEQ ID NO: 43) LLEML instead of (SEQ ID NO: 1) LxxLL). The side chains of Leu 540 
and Met 543 lie in approximately the same locations as those of the first and second motif leucines 
(Leu 690 and Leu 693) in the peptide complex. Leu 540 is inserted into the groove and makes van 
der Waals contacts with Leu 354, Val 376 and Glu 380. Met 543 lies along the edge of the groove 
20 and forms van der Waals contacts with the side chains of Leu 354, Val 355 and He 358. The side 
chain position of Leu 544 almost exactly overlaps that of the third NR box leucine, Leu 694. Deep 
within the groove, the Leu 544 side chain makes van der Waals contacts with the side chains of lie 
358, Lys 362, Leu 372, Gin 375, Val 376 and Leu 379. 

Helix 12 in the OHT complex is also stabilized by N- and C-terminal capping interactions. 

25 Lys 362 interacts with the C-terminal turn of helix 12 much as it does with the equivalent turn of 
the peptide helix. The Lys 362 side chain packs against the C-terminal turn of the helix 12 with its 
e-amino group hydrogen bonding to the carbonyls of residues 543 and 544. Given that the capping 
interaction at the N-terminal turn coactivator helix is formed by a helix 12 residue (Glu 542), the N- 
terminal turn of helix 12 in the antagonist complex is forced to interact with another residue, Glu 

30 380. The Glu 380 y-carboxylate forms van der Waals contacts with Tyr 537 and interacts with the 
amide of Tyr 537 through a series of water-mediated hydrogen bonds. 

In addition to forming these "NR box-like" interactions, helix 12 also forms van der Waals 
contacts with areas of the ERa LBD outside of the coactivator recognition groove. The side chain 
of Leu 536 forms van der Waals contacts with Glu 380 and Trp 383 and that of Tyr 537 forms van 
35 der Waals contacts with His 373, Val 376 and Glu 380. As a result of these contacts, helix 12 in the 
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OHT complex buries more solvent accessible surface area (-1200 A 2 ) than the NR box peptide in 
the DES-ERcc LBD-peptide complex. 



Identification and characterization of the coactivator binding site for TR, and extension of 
this information to other nuclear receptors shows that this site is common for all nuclear receptors 
identified to date. Additionally, sequence and structural comparison, coupled with the Examples 
showing differential specificity for coactivator binding to TR, GR and ER, reveal that minor 
differences between the receptors, such as found in helix 12, are likely to influence specificity of a 
coactivator for different types of nuclear receptors. Thus, the Examples presented herein 
demonstrate that information derived from the structure and function of the TR coactivator binding 
site can be applied in design and selection of compounds that modulate binding of coactivator 
1 5 proteins to nuclear receptors for all members of the nuclear receptor super family. 
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Appendix 1 

Atomic Coordinates for Human TR-B Complexed With T 3 , and a GRIP1 NR-box 2 Peptide 
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two molecules of TRB - CHAIN A and CHAIN B 

two molecules of T3 - CHAIN J and CHAIN K 

two molecules of GRIP-1 peptide - CHAIN X and CHAIN Y 

chain X lies between A and'B 

chain Y interacts with B only 

residues differing between A and B include: 

A 217 Glu, A 252 Gin, A 263 Lys (missing side chains) 

B 237 Ser, B239 His, B 394 Lys (missing side chains) 

additionally Gly 261, Gly 262 are not visible in chain 
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176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 
188 
189 
190 
191 
192 
193 
194 
195 
196 



CE1 HIS 
NE2 HIS 



C 
O 
N 

CA 
CB 



HIS 
HIS 
VAL 
VAL 
VAL 



CGI VAL 
CG2 VAL 



C 
O 
N 

CA 
CB 
C 
O 
N 

CA 
CB 



VAL 
VAL 
ALA 
ALA 
ALA 
ALA 
ALA 
THR 
THR 
THR 



OG1 THR 
CG2 THR 
C 
O 
N 

CA 
CB 

CG ASN 
OD1 ASN 
ND2 ASN 



229 

229 

229 

229 

230 

230 

230 

230 

230 

230 

230 

231 

231 

231 

231 

231 

232 

232 

232 

232 

232 



C 
O 
N 

CA 

CB 

C 

O 

N 

CA 
CB 
CG 
CD 



C 
O 
N 

CA 
C 
O 
N 

CA 
CB 
OG 
C 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 

THR A 232 
THR A 232 
ASN A 233 
ASN A 233 
ASN A 233 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



ASN 
ASN 
ALA 
ALA 
ALA 
ALA 
ALA 
GLN 
GLN 
GLN 
GLN 
GLN 



OE1 GLN 
NE2 GLN 



GLN 
GLN 
GLY 
GLY 
GLY 
GLY 
SER 
SER 
SER 
SER 
SER 



233 
233 
233 
233 
233 
234 
234 
234 
234 
234 
235 
235 
235 
235 
235 
235 
235 
235 
235 
236 
236 
236 
236 
237 
237 
237 
237 
237 



58 

27.654 

28.635 

26.225 

25.591 

26.519 

26.088 

26.890 

26.557 

28.381 

24.603 

23.959 

24.072 

22.669 

22.482 

21.792 

20.565 

22.436 

21.722 

22.112 

23.467 
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21.990 
22.055 
21.679- 
22.783 
23.134 
24.626 
25.141 
24.822 
25.951 
22.241 
22.312 
21.381 
20.423 
19.748 
20. 988 
22.041 
20.227 
20,562 
20.328 
18.887 
17.896 
17.668 
17.313 
21.960 
22.386 
22.676 
24.053 
24.923 
24.917 
25.739 
26.566 
27.981 
28.821 
25.938 



27 . 692 
26. 934 
24-541 
25.227 
23.256 
22.554 
21.256 
20 . 656 

21 .509 

22 .239 
22.316 
21.862 
21.500 
20.582 
22.734 
22. 647 
23.894 
25.161 
25.832 
26.283 
24 . 846 
26.114 
27.279 
25.625 
26. 468 
26.283 
27.355 
28.544 
26-951 
26.035 
24 - 900 
26. 954 
26.708 
28 .015 
26.062 
26. 419 
25.096 
24 .363 
25.239 
25.292 
25.420 
24 . 448 
26.596 
23.840 
23.458 
23.766 
23.245 
23.491 
24 .565 
22.526 
22.760 
22.206 
23.145 
22 .127 



32.280 
31 .840 
34 .312 
33.528 
34.113 
32.916 
32.701 
31.345 
32.817 
32.900 
31.847 
34.059 
34.175 
35.374 
34 .314 
34.324 
34.384 
34.528 
35.850 
35.791 
37.008 
33.387 
33.436 
32.381 
31.231 
30.880 
29.927 
30.096 
28. 959 
30.073 
29. 604 
29. 646 
28.564 
28.186 
27. 308 
26.822 
26.819 
25.629 
24.391 
23.908 
25.019 
25.768 
25.149 
25.573 
24.508 
26.687 
26.627 
25.390 
24 .844 
24 .991 
23.801 
24.015 
24 .689 
22.542 



1.00 26.95 6 
1.00 31.27 7 
1.00 38.40 6 
1.00 41.49 8 
1.00 38.55 7 
1.00 40.40 6 
1.00 44.68 6 
1.00 39.39 6 
1.00 42.18 6 
1.00 44.28 6 
1.00 45.94 8 
1.00 45.59 7 
1.00 47.84 6 
1.00 45.08 6 
1.00 48.04 6 
1.00 49.95 8 
1.00 47.26 7 
1.00 43.64 6 
1.00 41.93 6 
1.00 39.10 8 
1.00-29.80 6 
1.00 43.97 6 
1.00 40.55 8 
1.00 48.62 7 
1.00 58.62 6 
1.00 62.44 6 
1.00 68.35 6 
1.00 65.50 8 
1.00 74.29 7 
1.00 65.06 6 
1.00 69.47 8 
1.00 68.80 7 
1.00 70.98 6 
1.00 71.43 6 
1.00 73.83 6 
1.00 74.33 8 
1.00 75.07 7 
1.0076. 32 6 
1.00 76.98 6 
1.00 77.07 6 
1.00 80.85 6 
1.00 82.01 8 
1.00 78.80 7 
1.00 77.15 6 
1.00 76.06 8 
1.00 77.46 7 
1.00 78.37 6 
1.00 79.43 6 
1.00 79.47 8 
1.00 77.98 7 
1.00 76.49 6 
1.00 76.46 6 
1.00 40.00 8 
1.00 75.35 6 
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ATOM 

ATOM 

ATOM 
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197 
198 
199 
200 
201 
202 
203 
204 
205 
206 
207 
208 
209 
210 
211 
212 
213 
214 
215 
216 
217 
218 
219 
220 
221 
222 
223 
224 
225 
226 
227 
228 
229 
230 
231 
232 
233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 



O 
N 

CA 
CB 
CG 



SER 
HIS 
HIS 
HIS 
HIS 



CD2 HIS 
ND1 HIS 
CE1 HIS 
NE2 HIS 



C 
O 
N 

CA 
CB 
CG 



HIS 
HIS 



TRP 
TRP 
CD2 TRP 
CE2 TRP 
CE3 TRP 
CD1 TRP 
NE1 TRP 
C22 TRP 
CZ3 TRP 
CH2 TRP 
C 
O 
N 

CA 
CB 
CG 
CD 
CE 
NZ 
C 
O 
N 

CA 
CB 
CG 



LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 



ASN 
ASN 
OD1 ASN 
ND2 ASN 



C 
O 
N 

CA 
CB 
CG 
CD 
CE 
NZ 
C 
O 
N 

CA 
CB 



ASN 



LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
ARG 
ARG 
ARG 



237 
238 
238 
238 
238 
238 
238 
238 
238 
238 
238 



239 
239 
239 
239 
239 
239 
239 
239 
239 
239 



240 
240 
240 
240 
240 
240 
240 
240 
240 



241 
241 
241 
241 
241 



242 
242 
242 
242 
242 
242 
242 
242 
243 
243 
243 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 

TRP A 239 
TRP A 239 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 

TRP A 239 
TRP A 239 
A 
A 
A 
A 
A 
A 
A 
A 
A 

ASN A 241 
ASN A 241 
A 
A 
A 
A 
A 

ASN A 241 
LYS A 242 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



59 

26. 605 
24.648 
23.842 
22.990 
22.408 
22.790 
21.223 
20.951 
21.874 
22.971 
21.863 
23.487 
22.872 
23.563 
25.022 
25.532 
26. 961 
24.936 
26.102 
27.268 
27.798 
25.763 
27.171 
22.799 
21.706 
23.946 
23. 978 
25.314 
26.529 
27.805 
28.980 
30.238 
22.808 
22.550 
22.113 
20.976 
21.122 
22.304 
22.404 
23.176 
19.570 
18.581 
19.475 
18.191 
18.164 
18.250 
17.004 
15.755 
15.927 
18.143 
17.102 
19.334 
19.617 
21.070 
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21.418 
22 .410 
21.981 
20.732 
19.933 
18.757 
20.303 
19.365 
18.444 
23.284 
23.137 
24.510 
25.894 
27.026 
27.366 
28.662 
28.522 
29.911 
26.548 
27.241 
29. 598 
30.967 
30.825 
26.407 
26.562 
26.701 
27.180 
26.780 
27.342 
27.037 
27.720 
27.438 
26.699 
27.298 
25.640 
25.078 
23.562 
23. 121 
23.506 
22.310 
25.421 
24.822 
26. 380 
26.786 
26.396 
24 .896 
24.149 
24.491 
24.161 
28 .291 
28.923 
28.813 
30.219 
30.274 



21 .797 
22.353 
21.236 
21.661 
20.542 
19.957 
19.875 
18.953 
18 .994 
20.964 
20.441 
21.368 
21.195 
22.005 
21.688 
21.240 
21.136 
20. 969 
21.781 
21.475 
20.764 
20.569 
20. 482 
19.774 
19.263 
19.157 
17.783 
17.153 
17.872 
17.108 
17.776 
17.034 
16.895 
15.851 
17.325 
16.599 
16.550 
15.693 
14.503 
16.271 
17.152 
16.731 
18.069 
18.642 
20.119 
20.337 
19.821 
20.643 
22.109 
18.483 
18.592 
18 .204 
17.975 
17.463 



1.00 75.47 
1-00 75.56 
1.00 75.46 
1.00 75.85 
1.00 40.00 
1.00 40.00 
1.00 40.00 
1.00 40.00 
1.00 40.00 
1.00 74.10 
1.00 75.34 
1.00 73.39 
1.00 74.02 
1.00 81.77 
1.00 89.67 
1.00 93.19 
1.00 95.46 
1.00 95.35 
1.00 94.16 
00 97.48 
00 96.23 
00 96.75 
00 97.32 



-00 70.77 
00 71.70 
00 67.10 
00 65.63 
00 66.65 



1.00 69.83 
1.00 71.49 
1.00 71.31 
1.00 72.23 
1.00 66.19 
1.00 65.20 
1.00 66.69 
1.00 67.53 
1.00 67.98 
1.00 70.19 
1.00 71.37 
1-00 71.48 
1.00 66.62 
1.00 64.76 
1.00, 66.86 
1.00 67.46 
1-00 67.93 
1-00 71.52 
1.00 74.32 
1.00 74.41 
1.00 74.44 
1.00 66.28 
1.00 67.61 
1.00 64.19 
1.00 62.43 
1.00 60.12 
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6 

6 

6 

6 

7 

6 

7 

6 

8 
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6 

6 

6 

6 

6 

6 

6 
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6 
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6 

6 

6 

6 

6 

7 
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255 
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263 
264 
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266 
267 
268 
269 
270 
271 
272 
273 
274 
275 
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278 
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281 
282 
283 
284 
285 
286 
287 
288 
289 
290 
291 
292 
293 
294 
295 
296 
297 
298 
299 
300 
301 
302 
303 
304 



CG 
CD 
NE 
CZ 



ARG 
ARG 
ARG 
ARG 



NH1 ARG 
NH2 ARG 



ARG 
ARG 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
LYS 
PHE 



C 
O 
N 

CA 
CB 
CG 
CD 
CE 
NZ 
C 
O 
N 

CA 
CB 
CG 
CD1 
CD2 
CE1 

CE2 PHE 
CZ PHE 



243 

24 3 

243 

243 

243 

243 

243 

243 

244 

244 

244 

244 

244 

244 

244 

24 4 

244 

245 



PHE 
PHE 
PHE 
PHE 



C 
O 
N 

CD 

CA 

CB 

CG 

C 

O 

N 

CA 
CB 
CG 
CD 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 

PHE A 245 
PHE A 245 
A 
A 
A 
A 
A 
A 
A 
A 

LEU A 246 
LEU A 246 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 

PRO A 247 
GLU A 248 
A 
A 
A 
A 
A 



PHE 
PHE 



C 
O 
N 

CA 
CB 
CG 

CD1 LEU 
CD2 LEU 



LEU 
LEU 



LEU 
LEU 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 



GLU 
GLU 
GLU 
GLU 



245 
245 
245 
245 
245 
245 
245 
245 



246 

246 

246 

246 

246 

246 

247 

247 

247 

247 

247 

247 



OE1 GLU 
OE2 GLU 



C 
O 
N 

CA 



GLU 
GLU 
ASP 
ASP 



A 
A 
A 
A 
A 



248 
248 
248 
248 
248 
248 
248 
248 
249 
249 



21.665 
23.213 
23.826 
25.113 
25.905 
25.592 
18.639 
18.662 
17.771 
16.790 
15.368 
15.102 
13.641 
13.182 
13.951 
17.009 
16.562 
17.705 
18.045 
18 .825 
19.908 
19.618 
21.198 
20.614 
22.189 
21.897 
16.856 
15.946 
16.919 
15.884 
16.227 
15.100 
14 .010 
15.575 
15.717 
16.706 
14.473 
13.263 
14.198 
12.687 
12.161 
14.996 
15.159 
15.506 
16.280 
16.481 
17.012 
16.981 
16.432 
17.509 
15.624 
16.298 
14. 300 
13.493 



31 . 636 
31.599 
31.217 
31.439 
32.041 
31.097 
30.789 
30.390 
31.692 
32 .309 
31.974 
30.471 
30.167 
30.908 
30.536 
33.806 
34.514 
34.264 
35.692 
35.947 
34.979 
33.714 
35.309 
32.794 
34.385 
33.126 
36.620 
36.516 
37 .558 
38.554 
39.510 
40.384 
39.474 
41.325 
39.330 
39.609 
39.668 
39.314 
40.421 
40.671 
39.922 
41.733 
42.455 
42.006 
43.197 
43.273 
44.671 
44.939 
44 .144 
46.015 
44 .489 
45.395 
44.545 
45.703 



17.305 
17.267 
15.996 
15.714 
16.616 
14.520 
16.950 
15.784 
17.393 
16.498 
16.962 
17.104 
17.468 
18.737 
19.970 
16.501 
17.399 
15.468 
15.333 
14.049 
13.834 
13.399 
14.139 
13.255 
13.994 
13.552 
15.340 
14.528 
16.272 
16-437 
17.585 
18 .086 
18 .640 
19.151 
15.135 
14.430 
14.784 
15.534 
13.558 
13.600 
14.729 
13. 496 
14.486 
12.299 
11.976 
10.437 
9.966 
8.471 
7.644 
8-086 
12-458 
12.918 
12.323 
12.673 



1.00 40.00 6 
1.00 40.00 6 
1.00 40.00 7 
1.00 40.00 6 
1.00 40.00 7 
1.00 40.00 7 
1.00 62.97 6 
1.00 63.96 8 
1.00 62.41 7 
1.00 61.57 6 
1.00 63.68 6 
1.00 71.29 6 
1.00 73.83 6 
1.00 74.71 6 
1.00 73.32 7 
1.00' 59.30 6 
1.00 56.34 8 
1.00 57.06 7 
1.00 59.01 6 
1.00 59.62 6 
1.00 66.60 6 
1.00 67.17 6 
1.00 69.25 6 
1.00 69.92 6 
1.00 70.50 6 
1.00 70.89 6 
1.00 60.68 6 
1.00 62.37 8 
1.00 60.10 7 
1.00 59.44 6 
1.00 57.43 6 
1.00- 54.41 6 
1.00 52.43 6 
1.00 51.69 6 
1.00 62.05 6 
1.00 59.85 8 
1.00 63.33 7 
1.00 64.44 6 
1.00 63.56 6 
1.00 64.42 6 
1.00 64.90 6 
1.00 61.94 6 
1.00 61.60 8 
1.00 61.33 7 
1.00 63.50 6 
1.00 66.94 6 
1.00 68.70 6 
1.00 40.00 6 
1.00 40.00 8 
1.00 40.00 8 
1.00 64.19 6 
1.00 65.56 8 
1.00 64.36 7 
1.00 63.33 6 
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15.953 
16.619 
16.138 
17.454 
16.973 
17.634 
13. 650 
14.081 
12.756 
12.290 
11.651 
10.442 
12.601 
11.267 
10.854 
10.849 
9.871 
9.414 
10. 498 
9.789 
11.836 
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13.444 
12.568 
14.238 
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13.541 
14.014 
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14.681 
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14.323 
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10.657 
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17.060 
17.459 
16.686 
17.109 
16.931 
16.292 
17.263 
18.116 
16.145 
15.854 
14.598 
14.795 
13.352 
17.045 
17.478 
17.561 
18. 69-2 
19.136 
19.272 
18.179 
18.041 
19.018 
20.190 
18.802 
19.823 
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20.135 
21.171 
21.198 
22.208 
21.525 
20.942 
21.879 
19.671 
19.289 
17.822 
17.472 
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19.529 
20.073 
19.121 
19.277 
18.586 
17.083 
16.484 
16.431 
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34.215 
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36.808 
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32.386 
31.118 
30.413 
30.099 
31.281 
30.176 
29.546 
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29.374 
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27.921 
29.077 
29. 529 
28.886 
30.593 
29.582 
28.735 
30.882 
31.434 
32. 965 
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28.568 
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36.595 
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37.412 
38.207 
37.165 
37.851 
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35.898 
37.444 
37.713 
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36.218 
34 .780 
34.013 
32.977 
32.117 
31.352 
31.327 
30.595 
37.170 
37.855 
37.190 
38.057 
37.981 
38 .952 
36.578 
39.498 
40.229 
39.892 
41.239 
41.548 
42.945 
41.360 
41.353 
42.345 
40.327 
40.306 
39.072 
39.149 
40.110 
38.231 
40.305 
40.808 
39.737 
39.704 
38 .793 
38.987 
38.974 



1.00 33.74 8 
1.00 29.96 7 
1.00 25.94 6 
1.00 26.95 6 
1.00 15.40 6 
1.00 26.73 6 
1.00 34.31 6 
1.00 31.39 6 
1.00 35.69 8 
1.00 30.90 7 
1.00 33.49 6 
1.00 37.18 6 
1.00 46.48 8 
1.00 32.85 6 
1.00 29.94 6 
1.00 25.55 8 
1.00 32.70 7 
1.00 34.27 6 
1.00 33.78 6 
1.00 45.15 6 
1.00 58.24 6 
1.00 68.41 7 
1.00 72.31 6 
1.00 77.89 7 
1.00 69.25 7 
1.00 34.81 6 
1.00 36.03 8 
1.00 31.71 7 
1.00 30.16 6 
1.00 29.00 6 
1.00 28.64 6 
1.00 28.28 6 
1.00 32.50 6 
1.00 33.48 8 
1.00 30.96 7 
1.00 29.14 6 
1.00 31.27 6 
1.00 24.21 6 
1.00 30.51 6 
1.00 28.89 6 
1.00 27.29 8 
1.00 28.76 7 
1.00 35.32 6 
1.00 33.29 6 
1.00 38.15 6 
1.00 34.70 8 
1.00 34.43 8 
1.00 36.70 6 
1.00 37.96 8 
1.00 35.96 7 
1.00 37.10 6 
1.00 37.97 6 
1.00 36.50 6 
1.00 36.75 6 
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25.931 
27.331 
27.307 
28 . 008 
23.631 
24.317 
23.393 
23.917 
23.523 
23.346 
23.994 
22.114 
21.429 
19.994 
19.025 
17.628 
17.135 
17.196 
22.120 
21.967 
22.865 
23.571 
23. 655 
22.271 
22.331 
20. 941 
20.273 
24.948 
25. 642 
25.312 
26.594 
27.153 
27.358 
27.945 
28.267 
26.434 
25.803 
27.028 
27.851 
26.905 
27.755 
28.202 
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28 .522 
29.021 
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29.218 
26.860 
29.233 
28.052 
27.105 
26.192 
28.228 
28.448 
29.828 
27.393 
26.973 
26.979 
25.971 
25.746 
26.819 
26.246 
25.478 
26.327 
24.632 
23.857 
24.366 
23.120 
22.708 
22.492 
21. 606 
21.362 
20. 165 
23.185 
22.184 
24.370 
24.583 
25.972 
26.290 
27. 675 
25.242 
24.405 
25.235 
23.333 
22.330 
23.096 
21.860 
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24 .298 
24.739 
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26.475 
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27.886 
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28.091 



39.199 
39.161 
39.387 
39.371 
41.083 
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41.877 
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19.179 
19.265 
17.931 
17.613 
16.512 
18.436 
20.359 
20.265 



28.107 
29. 168 
30.493 
28.833 
31.478 
29.819 
31.146 
27 .772 



28.576 
26.572 
26.125 
24 .733 
24.663 
26.125 
25.954 
26.327 
26.384 
25.830 
24 .309 
23.747 
23.946 
23.086 
27.831 
28.184 
28.631 
30.034 
30.864 
30.658 
31.030 
31.454 
30.366 
29.660 
31.405 
32.241 
31.811 
32.990 
33.155 
32.212 
32.684 
32.035 
32.369 
31.877 
33.162 
33.863 
34 .288 
34 .654 
36.096 
36.833 
36.996 
35.904 
35.952 
34.976 
36.492 
37 .441 



47.715 
47.524 
47.747 
47.116 
47.564 
46.930 
47.155 
50.164 
51.034 
50.062 
50.962 
50.524 
48.829 
52.423 
53.322 
52.645 
53.996 
54 .015 
54.061 
55.342 
56.412 
55.335 
54.488 
55.426 
53.846 
54.159 
52.923 
52.315 
50.861 
53.086 
54.548 
54.148 
55.365 
55.930 
55.787 
56.738 
56.872 
54.558 
53.556 
54.655 
53.567 
53.915 
54.440 
53.291 
52.282 
54.202 
54.016 
55.360 
56.125 
57.095 
57.706 
57.292 
53-152 
52.379 



1.00 40.98 
1.00 42.78 
1.00 44.40 
1.00 43.66 
1.00 39.83 
1.00 46.21 
1.00 45.18 
1.00 45.54 
1.00 42.01 
1.00 47.05 
1.00 50.15 
1.00 45.90 
1.00 51.50 
1.00 51.38 
00 53.83 
00 49.72 



00 52.53 
00 57.40 
00 69.63 
00 78.49 
00 82.82 
1.00 85.30 
1.00 48.54 
1.00"49.82 



00 43.79 
.00 45.42 
.00 41.04 
.00 42.74 
.00 40.99 
.00 39.44 
.00 45.56 
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6 
6 
6 
6 
6 
6 
6 
8 
7 
6 
6 
16 
6 



1.00 43.07 
1.00 46.99 
1.00 47.12 
1.00 49.61 
1.00 49.91 
1.00 51.28 
1.00 49.59 
1.00 51.66 
1.00 51.02 
1.00 52.86 
1.00 54.57 
1.00 67.87 
1.00 48.51 
1.00 49.58 
1.00 44.17 
1.00 47.57 
1.00 49.92 
1.00 59.30 
1.00 63.80 
1.00 69.03 
1.00 67.10 
1.00 46.57 
1.00 44.65 
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292 
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CA ASP 
CB ASP 
CG ASP 
OD1 ASP 
OD2 ASP 



C 
O 
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CA 
CB 
CG 
CD 



ASP 
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GLN 
GLN 
GLN 
GLN 
GLN 



OE1 GLN 
NE2 GLN 
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CA 
CB 
CG2 
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A 
A 
A 
A 
A 
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A 
A 
A 
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A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 

ILE A 303 
LEU A 304 
A 
A 
A 
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A 
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305 

305 
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67 

21.467 
22.661 
23.919 
24.223 
24.153 
24.572 
22.514 
22.775 
22.095 
21.896 
20.991 
21.644 
20.690 
19.658 
21.027 
21.242 
21.482 
20.413 
19.726 
18.502 
17.818 
17.502 
17.003 
20.698 
20.960 
21.228 
22.179 
23.023 
23.946 
22.141 
22.916 
23.093 
23.354 
23.580 
24.465 
24.935 
26.150 
26.267 
26.084 
23.764 
24.212 
22.657 
21.892 
20.565 
20.637 
19.247 
21.340 
21.665 
21.954 
21.157 
20.868 
20.293 
18.919 
18.397 
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35.765 
36.042 
35.513 
36.239 
37.488 
35.575 
35.390 
36.021 
34 .124 
33.390 
32.179 
31.003 
29.824 
29.957 
28.685 
34.305 
34.185 
35.228 
36. 179 
36.774 
37.788 
35.673 
34.897 
37.268 
37.453 
37.972 
39.060 
39.338 
40.522 
39.653 
39.806 
38.705 
39.509 
37.477 
36.964 
35.554 
35.480 
36.731 
34.226 
36.968 
37.623 
36.236 
36.147 
35.434 
33.950 
33.370 
33.280 
37.524 
37.747 
38.439 
39.800 
40.615 
40.163 
41.127 



53.294 
52.509 
53.213 
54.473 
54.493 
55.483 
51.138 
50.113 
51.137 
49. 902 
50.137 
50.808 
50.988 
51. 675 
50.394 
48.877 
47.686 
49.372 
48.511 
49.217 
48.342 
49.581 
48.385 
48.096 
46.906 
49.097 
48.874 
50. 109 
49.861 
51.313 
52.589 
47.722 
46.835 
47.762 
46.734 
47. 123 
48.029 
48.876 
48.861 
45.389 
44.443 
45.318 
44.089 
44.359 
44.635 
44.779 
43.466 
43.477 
42.301 
44 .298 
43.864 
45.026 
45.511 
46.559 



1-00 45.17 
1.00 43.32 
1.00 37.38 
1.00 36.23 
1.00 35.87 
1.00 40.14 
1-00 42.81 
1.00 46.02 
1.00 38.60 
1.00 40.00 
1.00 38.59 
1.00 40.26 
1.00" 44.15 
1.00 45.73 
1.00 46.13 
1.00 41.64 
1.00 45.02 
1.00 41.01 
1.00 40.23 
1.00 39.52 
1.00 31.98 
1.00 40.77 
1.00 45.43 
1.00 38.58 
1-00 40.81 
1.00 37.50 
1.00 39.33 
1.00 39.06 
1.00 36.19 
1.00* 40.15 
1.00 36.93 
1.00 36.49 
1.00 36.58 
1.00 32.91 
1-00 27.55 
l.OQ 22.35 
1.00 26.88 
1.00 24.82 
1.00 23.69 
1.00 28.05 
1.00 24.68 
1.00 26.34 
1.00 30.91 
1.00 32.50 
1.00 33.36 
1.00. 33.87 
1.00 31.72 
1.00 29.76 
1.00 29.33 
1.00 29.72 
1.00 34.28 
1.00 35.98 
1-00 43.35 
1.00 51.50 
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MET 
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307 
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309 
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309 
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310 
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310 
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311 
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313 
313 
313 
313 
313 
313 
313 
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A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



18.271 
18.325 
22.075 
21.972 
23.228 
24.445 
25.109 
25.489 
25.248 
25.899 
26.604 
25.472 
24.974 
25.458 
23.664 
22.739 
21.311 
20.198 
23.065 
23.212 
23.157 
23.476 
23.482 
23.913 
22.765 
23.650 
24.842 
25.020 
25.800 
27.176 
27.973 
27.842 
28.726 
28.891 
29.270 
27.266 
27.956 
26.569 
26.593 
25.991 
25.917 
26.837 
26.462 
25.871 
26.274 
24.788 
24.013 
22.716 
21.608 
19.892 
20.034 
24.834 
25.116 
25.209 



42.515 
43.548 
40.527 
41.286 
40.306 
40.962 
40.353 
41.087 
39.024 
38.326 
37.089 
35.770 
37.870 
37.319 
38.084 
37.623 
38.004 
37.299 
38.123 
37.334 
39.439 
40.016 
41.547 
42.230 
41.751 
42.321 
39.527 
39.076 
39.638 
39.234 
39.303 
40.628 
40.720 
39.706 
41.818 
37.827 
37.620 
36.866 
35.497 
34.518 
33.123 
34.471 
33.342 
35.371 
34.593 
36.130 
36.081 
36.888 
36. 198 
36.817 
38.503 
36.579 
35.814 
37 .859 



45.971 1.00 53.26 6 
47.038 1-0O 59.61 7 
43.302 1.00 35.25 6 
42.334 1.00 33.95 8 
43.928 1.00 35.79 7 
43.482 1.00 34.59 6 
42.259 1.00 33.80 6 
41.344 1.00 31.59 8 
42.256 1.00 31.15 7 
41.174 1.00 29.04 6 
41.704 1.00 27.59 6 
42.071 1.00 30.50 16 
40.062 1.00 30.59 6 
39.077 1.00 33.77 8 
40.195 1.00 28.46 7 
39.168 1.00 30.10 6 
39.490 1.00 33.43 6 
38.307 1.00 35.20 16 
37.788 1.00 27.72 6 
36.865 1.00 27.69 8 
37.639 1.00 26.15 7 
36.342 1.00 26.06 6 
36.419 1.00 25.32 6 
35.109 1.00 24.08 6 
33.762 1.00 27.71 16 
32.270 1.00 28.50 6 
35.908 1.00 25.94 6 
34.788 1.00 28.09 8 
36.826 1.00 25.39 7 
36.589 1.00 27.03 6 
37.900 1.00 24.39 6 
38.668 1.00 26.00 6 
39.870 1.00 23.95 6 
40.588 1.00 19.72 8 
40.159 1.00 26.51 8 
35.997 1.00 27.51 6 
35.014 1.00 29.67 8 
36.602 1.00 26.82 7 
36.112 1.00 25.71 6 
37.123 1.00 23.35 6 
36.533 1.00 20.27 6 
38.398 1.00 20.88 6 
39.341 1.00 18.15 6 
34.791 1.00 27.91 6 
33.934 1.00 28.96 8 
34.633 1.00 27.66 7 
33.395 1.00 30.18 6 
33.508 1.00 36.89 6 
34.305 1.00 37.95 6 
34.055 1.00 42.38 16 
34.740 1.00 40.68 6 
32.222 1.00 27.43 6 
31.308 1.00 28.61 8 
32.253 1.00 24.88 7 
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NH1 ARG 
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CGI VAL 
CG2 VAL 
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NH1 ARG 
NH2 ARG 
C ARG 
O ARG 
N TYR 
CA TYR 
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A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
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A 
A 
A 
A 
A 
A 
A 
A 
A 

ALA A 318 
ALA A 318 
VAL A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
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A 
A 
A 
A 
A 
A 
A 
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ARG 
ARG 
ALA 
ALA 
ALA 
ALA 
ALA 
ALA 
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315 
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316 
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320 
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321 

321 



69 

26.005 
26.354 
26.956 
27.275 
27.675 
27.905 
29.099 
29.685 
30.675 
31.866 
31.125 
28. 700 
29.304 
27.678 
27.151 
25.915 
26. 188 
24.934 
25.245 
24.341 
23.084 
24.701 
26.774 
26.737 
26.484 
26.094 
25.232 
27.323 
27.398 
28.286 
29.515 
30.434 
30. 181 
30.600 
30.255 
30.880 
30.703 
30.895 
29.353 
30.215 
30.640 
29.176 
28.415 
27.031 
26.863 
25.637 
25.258 
24.331 
23.667 
23. 964 
28.292 
27.251 
29.352 
29.366 
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38.472 
39.914 
39.972 
37.679 
37.629 
37.048 
36.261 
35.593 
34 .479 
35.040 
33.789 
35. 180 
35.036 
34 .426 
33.312 
32.752 
32.190 
31.526 
30.376 
29.468 
29. 614 
28.416 
33. 660 
32.792 
34.936 
35. 411 
36. 666 
35.714 
35.342 
36.396 
36.760 
37.452 
35^502 
35T~447 
34.491 
33.198 
32.210 
32.891 
31.552 
32.608 
31.575 
33.2.84 
32.822 
32.458 
30.991 
30.642 
29.256 
28.625 
29.291 
27.385 
33.825 
33.909 
34 .583 
35.555 



31.197 
31.581 
32.858 
30.851 
29. 690 
31.845 
31.563 
32.816 
32.505 
31.756 
33.765 
30.597 
29.556 
30. 982 
30.216 
30.928 
32.336 
32. 901 
33.721 
34 .054 
33.639 
34 .776 
28 .794 
27.931 
28.571 
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-29.432 69.801 7.535 1.00 43.72 

-28.589 70.413 5.546 1.00 44.62 

-28.983 71.812 5.658 1.00 45.09 

-28.354 72.608 4.510 1.00 44.66 

-26.847 72.735 4.539 1.00 51.06 

-26.226 71.367 4.640 1.00 48.58 

-26.364 73.450 3.299 1.00 45.18 

-30.508 71.965 5.652 1.00 48.06 

-31.211 71.244 4.959 1.00 49.33 

-30.988 72.911 6.458 1.00 52.20 

-32.407 73.214 6.588 1.00 54.41 

-32.870 74.013 5.370 1.00 54.94 

-33.687 75.220 5.749 1.00 60.35 

-33.182 76.130 6.430 1.00 61.84 

-34.935 75.242 5.324 1.00 65.92 

-33.251 71.959 6.731 1.00 58.00 

-34.464 72.000 6.579 1.00 60.17 

-32.596 70.846 7.054 1.00 58.45 

-33.295 69.587 7.235 1.00 58.55 

-33.909 69.004 5.984 1.00 59.79 

-34.609 68.000 6.065 1.00 61.32 

-33.639 69.628 4.838 1.00 60.28 

-34.196 69.182 3.571 1.00 59.13 

-34.966 70.323 2.885 1.00 62.40 

-36.099 70.963 3.690 1.00 75.69 

-36.720 72.135 2.998 1.00 80.41 

-35.984 73.081 2.618 1.00 79.98 

-37.966 72.158 2.830 1.00 83.81 

-33.110 68.722 2.624 1.00 57.18 

-33.236 67.689 1.974 1.00 57.50 
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5 ATOM 1451 CD PRO B 447 -22.837 85.637 -8.886 1.00 72.88 6 

ATOM 1452 CA PRO B 447 -23.461 87.997 -8.653 1.00 74.22 6 

ATOM 1453 CB PRO B 447 -24.399 87.338 -9.659 1.00 72.98 6 

ATOM 1454 CG PRO B 447 -23.981 85.934 -9.776 1.00 74.77 6 

ATOM 1455 C PRO B 447 -24.203 88.519 -7.451 1.00 75.94 6 

10 ATOM 1456 O PRO B 447 -24.601 87.749 -6.611 1.00 76.67 8 

ATOM 1457 N THR B 448 -24-390 89.828 -7.373 1.00 76.91 7 

ATOM 1458 CA THR B 448 -25.134 90.436 -6.268 1.00 78.24 6 

ATOM 1459 CB THR B 448 -24.883 91.948 -6.276 1.00 81.33 6 

ATOM 1460 OG1 THR B 448 -25.474 92.525 -7.451 1.00 84.46 8 

15 ATOM 1461 CG2 THR B 448 -23.394 92.234 -6.269 1.00 83.51 6 

ATOM 1462 C THR B 448 -26.594 90.160 -6.619 1.00 77.42 6 

ATOM 1463 O THR B 448 -27.512 90.649 -5.982 1.00 77.65 8 

ATOM 1464 N GLU B 449 -26.759 89.400 -7.697 1.00 76.29 7 

ATOM 1465 CA GLU B 449 -28.051 89.017 -8.211 1.00 75.03 6 

20 ATOM 1466 CB GLU B 449 -27.923 88.915 -9.719 1.00 74.62 6 

ATOM 1467 CG GLU B 449 -28.823 87.966 -10.343 1.00 40.00 6 

ATOM 1468 CD GLU B 449 -28.522 87.831 -11.756 1.00 40.00 6 

ATOM 1469 OE1 GLU B 449 -27.366 88.072 -12.190 1.00 40.00 8 

ATOM 1470 OE2 GLU B 449 -29.449 87.438 -12.492 1.00 40.00 8 

25 ATOM 1471 C GLU B 449 -28.448 87.660 -7.609 1.00 73.49 6 

ATOM 1472 O GLU B 449 -29.479 87.092 -7.953 1.00 70.24 8 

ATOM 1473 N LEU B 450 -27.624 87.158 -6.695 1.00 70.80 7 

ATOM 1474 CA LEU B 450 -27.879 85.879 -6.058 1.00 68.82 6 

ATOM 1475 CB LEU B 450 -26.772 84.887 -6.447 1.00- 71.91 6 

30 ATOM 1476 CG LEU B 450 -26.612 84.503 -7.900 1.00 76.62 6 

ATOM 1477 CD1 LEU B 450 -25.396 83.637 -8.059 1.00 77.95 6 

ATOM 1478 CD2 LEU B 4'50 -27.849 83.775 -8.351 1.00 76.46 6 

ATOM 1479 C LEU B 450 -27.941 86.012 -4.536 1.00 66.22 6 

ATOM 1480 O LEU B 450 -28.251 85.039 -3.849 1.00 66.01 8 

35 ATOM 1481 N PHE B 451 -27.666 87.215 -4.012 1.0061.96 7 

ATOM 1482 CA PHE B 451 -27.635 87.494 -2.585 1.00 58.44 6 

ATOM 1483 CB PHE B 451 -26.579 88.573 -2.263 1.00 61.34 6 

ATOM 1484 CG PHE B 451 -25.153 88.078 -2.413 1.00 63.02 6 

ATOM 1485 CD1 PHE B 451 -24.675 87.587 -3.626 1 . 00 62 . 92 6 

40 ATOM 1486 CD2 PHE B 451 -24.283 88.173 -1.346 1.00 63.07 6 

ATOM 1487 CE1 PHE B 451 -23.327 87.217 -3.757 1.00 65.12 6 

ATOM 1488 CE2 PHE B 451 -22.939 87.806 -1.472 1.00 64.66 6 

ATOM 1489 CZ PHE B 451 -22.459 87.335 -2.686 1.00 67.12 6 

ATOM 1490 C PHE B 451 -28.931 87.994 -1.962 1.00 56.41 6 

45 ATOM 1491 O PHE B 451 -29.207 89.214 -1.908 1.00 56.56 8 

ATOM 1492 N PRO B 452 -29.791 87.072 -1.473 1.00 53.28 7 

ATOM 1493 CD PRO B 452 -29.767 85.611 -1.494 1.00 50.46 6 

ATOM 1494 CA PRO B 452 -31.037 87.598 -0.843 1.00 50.26 6 

ATOM 1495 CB PRO B 452 -31.746 86.375 -0.301 1.00 49.19 6 

50 ATOM 1496 CG PRO B 452 -31.024 85.197 -0.842 1.00 45.89 6 

ATOM 1497 C PRO B 452 -30.636 88.567 0.251 1.00 49.62 6 

ATOM 1498 O PRO B 452 -29.628 88.401 0.906 1.00 52.35 8 

ATOM 1499 N PRO B 453 -31.494 89.539 0.535 1.00 51.50 7 

ATOM 1500 CD PRO B 453 -32.853 89.644 0.022 1.00 49.66 6 

55 ATOM 1501 CA PRO B 453 -31.184 90.573 1.530 1.00 50.89 6 

ATOM 1502 CB PRO B 453 -32.422 91.401 1.625 1.00 51.49 6 

ATOM 1503 CG PRO B 453 -33.378 90.827 0.724 1.00 50.82 6 

ATOM 1504 C PRO B 453 -30.829 90.039 2.906 1.00 50.99 6 
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Appendix 2 

Atomic Coordinates for Human ERa Complexed with DES, and a GRIP1 NR-box 2 Peptide 

CRYSTl 54.094 82.217 58.041 90.00 111.33 90.00 P 21 2 
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10.724 -2.749-11.717 1.00 23.97 
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Atomic Coordinates for Human ERct Complexed With OHT 
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WHAT IS CLAIMED IS : 

55 1 . A method of identifying a compound that modulates coactivator binding to a nuclear 

receptor, said method comprising: 

modeling test compounds that fit spacially into a nuclear receptor coactivator 
binding site of interest using an atomic structural model of a nuclear receptor coactivator binding 
site or portion thereof. 
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5 screening said test compounds in an assay characterized by binding of a test 

compound to a nuclear receptor coactivator binding site, and 

identifying a test compound that modulates coactivator binding to said nuclear 

receptor. 

10 2. The method of claim 1 , wherein said atomic structural model comprises atomic 

coordinates of amino acid residues corresponding to residues of human thyroid receptor selected 
from the group consisting of Val284, Phe293, Ile302, Leu305, and Leu454. 



15 



20 
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35 



3. The method of claim 1 , wherein said atom.c structural model comprises atomic 
coordinates of amino acid residues corresponding to residues of human thyroid receptor selected 
from the group consisting of Val284, Lys288. Ile302. Lys306. Leu454 and Glu457. 

4. The method of claim 1, wherein said atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues of human thyroid receptor helix 3 
residues Ilc280 t Thx281, Val283, Val284, Ala287, and Lys288, helix 4 residue Phe293 helix 5 ^ 
residues Gln3Ql, Ile302, Leu305, Lys306, helix 6 residue Cys309, and helix 12 residues Pro453, 
Leu454, Glu457, Val458 and Phe459. 

5. The method of claim 1, wherein said nuclear receptor coactivator bindine site 
comprises amino acid residues corresponding to residues of human thyroid receptor selected from 
the group consisting of helix 3 residues Ile280. Thr281. Val283. Val284. Ala287. and Lvs288 heliv 
4 residue Phe293, helix 5 residues Gln301. Ile302, Leu305. Lys306 ? helix 6 residue Cvs309. and 
hehx42 residues Pro453. Leu454, Glu457, Val458 and Phe459. 
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The method of claim 5, wherein said amino acid residues corresponding to residues 
of human thyroid receptor comprise Val284, Phe293. Ile302. Leu305. and Leu454. 

7. The method of claim 5, wherein said amino acid residues corresponding to residues 
of human thyroid receptor comprise Val284, Lys288. Ile302, Lys306. Leu454 and Glu457. 

8. The method of claim 1 . wherein said nuclear receptor coactivator bindine site 
comprises amino acid residues corresponding to residues of human thvroid receptor of helix 3 
residues Ue280. Thr28L Val283. Val284. Ala287. and Lys288. helix 4 residue Phe-93 helix 5 
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5 residues Gln301. Ile302. Leu305. Lys306. helix 6 residue Cys309. and helix 12 residues Pro453. 
Leu454, Glu457. Val458 and Phe459. 

9. The method of any one of claims 5 through 8. wherein said nuclear receptor is 
selected from the group consisting of TR. RAR f RXR ? PPAR ; VDR. ER. GR, PR, MR. and AR. 

10 

10. The method of claim 1 . wherein said screening is in vitro. 

1 1 . The method of claim 10 r wherein said screening is high throughput screening. 
15 12. The method of claim U wherein said assay is a biological assay. 

13. The method of claim 1 . wherein said test compound is from a library of compounds. 

14. The method of claim 1 ? wherein said test compound is an agonist or antagonist of 
20 coactivator binding. 

15. The method of claim 14, wherein said test compound is a small organic molecule, a 
peptide, or peptidomimetic. 

25 16. The method of claim 15 ? wherein said compound is a peptide comprising a NR-box 

amino acid sequence, or derivative thereof. 

17. A method for identifying an agonist or antagonist of coactivator binding to a nuclear 
receptor, said method comprising the steps of: 
30 providing the atomic coordinates of a nuclear receptor coactivator binding site or 

portion thereof to a computerized modeling system; 

modeling compounds which fit spacially into the nuclear receptor coactivator 
binding site: and 

identifying in an assay for nuclear receptor activity a compound that increases or 
35 decreases the activity of said nuclear receptor by binding the coactivator binding site of said nuclear 
receptor, whereby an agonist or antagonist of coactivator binding is identified. 
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18. A machine-readable daia storage medium, comprising a data storage material 
encoded with machine readable data which, when using a machine programmed with instructions 
for using said data, is capable of displaying a graphical three-dimensional representation of a 
molecular complex of a compound bound to a nuclear receptor coactivator binding site comprisine 
structure coordinates of amino acids corresponding to human thyroid receptor amino acids selected 
from the group consisting of helix 3 residues Ile280, Thr281. Val283, Val284. Ala287. and Lys288. 
helix 4 residue Phe293. helix 5 residues Gln301, Ile302, Leu305, Lys306, helix 6 residue Cys309. 
and helix 12 residues Pro453, Leu454. Glu457, Val458 and Phe459, or a homologue of said 
molecular complex, wherein said homologue comprises a coactivator binding site that has a root 
mean square deviation from the backbone atoms of said amino acids of not more than 1.5A. 

1 9. The machine readable storage medium of claim 1 8. wherein said nuclear receptor is 
a thyroid receptor. 

20. The machine readable storage medium of claim 1 9, wherein said thyroid receptor is 

20 human. 

21 . The machine readable storage medium of claim 20, wherein said molecule is peptide. 

22. The machine readable storage medium of claim 2 1 . wherein said peptide comprises a 
25 NR-box amino acid sequence, or derivative thereof. 

23. The machine-readable data storage medium according to claim 18. wherein said 
molecular complex is defined by the set of structure coordinates depicted in Appendix 1. or a 
homologue of said molecular complex, said homologue having a root mean square deviation from 

30 the backbone atoms of said amino acids of not more than 1 .5A. 

24. A machine-readable data storage medium comprising a data storage material 
encoded with a first set of machine readable data which, when combined with a second set of 
machine readable data, using a machine programmed with instructions for using said first set of data 

35 and said second set of data, can determine at least a portion of the structure coordinates 
corresponding to the second set of machine readable data, wherein: said first set of data comprises a 
Fourier transform of at least a portion of the structural coordinates selected from the group 
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5 consisting of coordinates depicted in Appendix 1; and said second set of data comprises an X-ray 
diffraction pattern of a molecule or molecular complex. 

25. A cocrystal of a nuclear receptor comprising a molecule bound to the coactivator 
binding site of said nuclear receptor. 

10 

26. The cocrystal of claim 25 ? wherein said nuclear receptor is a thyroid receptor. 

27. The cocrystal of claim 26, wherein said thyroid receptor is human. 

15 28 ■ The cocrystal of claim 27, wherein said molecule is peptide. 

29. The cocrystal of claim 28. wherein said peptide comprises a NR-box amino acid 
sequence or derivative thereof. 



20 



30. A compound identified according to the method of claim 1. 
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AEGHSRLHDSKGQTKLLQLI/TTKSEQMEPSPI.PS 
AENQRGPLESKGHKKLLQLLTCSSKDRGHSSLTN 
AENQRGPLESKGHKKLLQLLTCSSDDRGHSSLTN 
SETPRGPLESKGHKKLLQLLTCSSEDRGHSSLTN 
SEGDSKY- - SQTSHKLVQLLTTTAEQQLRHADID 
. - S. . . .KLnQLLT 

PGSTHGTSI*KEKHKILHRI.I.QDSSSPVDI*AKI*TA 
SGSTHGTSXiKEKHKIIiHRIiliQDSSSPVDLAKLTA 
PGSTHGTSIiKEKHKILHRLLQDSSSPVDLAKIiTA 
TSNMHGSLLQEKHRILHKLLQNGNSPAEVAKITA 
TSNMHGSLLQEKHRILHKIiLQNGNSPAEVAKITA 
TSNVHGSLLQKKHRILHKLLQNGNSPAEVAKITA 
TCPSSHSSLTERHKILHRLLQB . GSPSDITTLSV 
I* . E+H+ILH+LLQ SP . -h. .n. • 

EPASPKKKE NALLRYI*tiDKDDTKD IGLPEIT 

EPVSPKKKB NALLRYIXDKDDTKDIGLPBIT 

EPASPKKKB NALLRYLLDKDDTKD IGLP SIT 

BQLSPKKKB- -NNALLRYLLDRDDPSDVI*AKKXiQ 
BQLSPKKKB- -NNALLRYLLDRDDPSDALSKELQ 
BQLSPKKKB- - NNALLiRYLLDRDDP SD ALSKEL.Q 
EIiDAAKKKESKDHQLiLiRYLLiDKDEKDLtRSTPNIiC 
B KKKB LLRYLLD+D - h. 



Overall Consensus (SEQ ID NO: 1) 



. LXXLIi. 
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FIGURE 7 



bHIiH 



PAS 



Nuclear 
Receptor 
interaction 



CBP & p300 
interaction 



Transcriptional 
activation domain 




NR-boxl 

residues 15-21 
SEQ ID NO: 5 



NR-boxl 

residues 15-21 
SEQ ID NO; 6 



NR box3 

residues 15-21 
SEQ ID NO: 7 



NRb 1,2,3 
NRb 1,2 
NRb 1,3 
NRb 1 



KLLQLLT . 

KLLQLLT . 
KLLQLLT . 



.ILHRLLQ. 
.ILHRLLQ. 
.AAHRAAQ. 
•AAKRAAQ. 



.LLRYLLD 
.AARAAAD 
. LLRYLLD 
.AARAAAD 




BNSDOQID: <WO. 



996001 4A2_I_> 



WO 99/60014 



PCT/US99/06899 



8/19 



FIGURE 8 

□ GST 

GST 563 767 ■ NRb 1,2,3 
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FIGURE 10 



residues 1 1-23 
residues 7-24 

Of 

SEQ ID NO: 6 



LLRYLL 
FKKKKHA LIAXXX 




DKDDTTCD 
DKDPTDK 



i 10 100 1009 
Peptide (uM) 



.01 



1 1 10 100 
Peptide (uM) 



residues 15-20 
residues U-27 
residues 8-27 
Of 

SEQ ID NO: 7 



residues 8027 of SEQ ID NO: 7 
residues 1 1-27 of SEQ ID NO: 6 
SEQ ID NO: 29 



PKKKBNA. LLRYLL D1CDDTJCD 
TSLKBKHK gSSSBBBJ QDS3 



TSLKBKHK LLRYLL QDSS 




10 100 1000 

(uM) 



BNSDOCID: <WO 996001 4A2J_> 



WO 99/60014 



PCT/US99/06899 _ 



11/19 




BNSDOCID: <WO 996001 4A2_I_> 



WO 99/60014 



PCT/US99/06899 




12/19 




FIG. 12 
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FIG. 13 
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FIG. 14 
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FIG. 15 
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FIG. 16 
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5 SEQUENCE LISTING 

<110> Baxter, John D. 

Darimont , Beatrice 
Feng, Weijun 
10 Fletterick, Robert J. 

Kushner, Peter J. 
Wagner, Richard L. 
West, Brian L. 
Yamamoto, Keith R. 

15 

<120> METHODS AND COMPOUNDS FOR MODULATING NUCLEAR RECEPTOR 
COACTIVATOR BINDING 

<130> UCAL-253/01US 

<140> 
<141> 

<150> US 60/079, 956 
25 <151> 1998-03-30 

<160> 51 

<170> Patentln Ver. 2.0 

<210> 1 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Leu Xaa Xaa Leu Leu 
1 5 



30 



35 



40 



45 



50 



55 



60 



65 



<210> 2 

<211> 6 

<212> PRT 

<213> Homo sapiens 

<400> 2 

lie Leu Xaa Xaa Leu Leu 
1 5 



<210> 3 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Phe Xaa Xaa Leu Trp 
1 5 



<210> 4 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Phe Xaa Xaa Ala Leu 
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<210> 5 
<2ll> 34 
10 <212> PRT 

<213> Homo sapiens 

<400> 5 

is Ala Glu Gly His Ser Arg Leu His Asp Ser Lys Gly Gin Thr Lys Leu 

5 10 15 . 

Leu Gin Leu Leu Thr Thr Lys Ser Glu Gin Met Glu Pro Ser Pro Leu 
20 25 30 

20 Ala Ser 



<210> 6 
25 <211> 34 
<212> PRT 

<213> Homo sapiens 

<220> 
30 <221> MUTAGEN 
<222> (15) 
<223> He Ala 

<220> 
35 <221> MUTAGEN 
<222> (16) 
<223> Leu --> Ala 

<220> 

40 <221> MUTAGEN 

<222> (19) 

<223> Leu - - > Ala 

<220> 
45 <221> MUTAGEN 
<222> (20) 
<223> Leu Ala 

<220> 
50 <221> MUTAGEN 

<222> (16) . . (20) 

<223> Leu (16) Ala; Leu(20) Ala 

<220> 

55 <221> MUTAGEN 

<222> (15) . . (16) 

<223> LeuLeu --> AlaAla 

<220> 
60 <221> MUTAGEN 

<222> (17) . . (18) 

<223> HisArg --> AlaAla 

<220> 
65 <221> MUTAGEN 

<222> (19) . . (20) 

<223> LeuLeu --> AlaAla 
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15 



20 



25 



35 



45 
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<220> 

<221> MUTAGEN 

<222> (15) 

<223> He --> Phe 

<220> 

<221> MUTAGEN 

<222> (16) 

<223> leu --> Phe 

<220> 

<221> MUTAGEN 

<222> (19) 

<223> Leu --> Phe 

<220> 

<221> MUTAGEN 

<222> (20) 

<223> Leu Phe 

<400> 6 

Pro Gly Ser Thr His Gly Thr Ser Leu Lys Glu Lys His Lys He Leu 
15 10 15 



30 His Arg Leu Leu Gin Asp Ser Ser Ser Pro Val Asp Leu Ala Lys Leu 

20 25 30 



Thr Ala 



<210> 7 
<211> 31 
<212> PRT 
40 <213> Homo sapiens 

<400> 7 

Glu Pro Ala Ser Pro Lys Lys Lys Glu Asn Ala Leu Leu Arg Tyr Leu 
15 10 15 



Leu Asp Lys Asp Asp Thr Lys Asp He Gly Leu Pro Glu He Thr 
20 25 30 



50 <210> 8 

<211> 34 

<212> PRT 

<213> Homo sapiens 

55 <400> 8 

Ala Asp Gly Gin Ser Arg Leu His Asp Ser Lys Gly Gin Thr Lys Leu 
15 10 15 

Leu Gin Leu Leu Thr Thr Lys Ser Glu Gin Met Glu Pro Ser Pro Leu 
60 20 25 30 

Ala Ser 



65 

<210> 9 
<211> 34 



BNSDOCID: <WO 996001 4A2_L> 
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PCT/US99/06899 

4 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Ser Gly Ser Thr His Gly Thr Ser Leu Lys Glu Lys His Lys He Leu 
1 . 5 10 15 

His Arg Leu Leu Gin Asp Ser Ser Ser Pro Val Asp Leu Ala Lys Leu 
20 25 30 

Thr Ala 



<210> 10 

<211> 31 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Glu Pro Val Ser Pro Lys Lys Lys 
1 5 

Leu Asp Lys Asp Asp Thr Lys Asp 
20 



Glu Asn Ala Leu Leu Arg Tyr Leu 
10 15 

lie Gly Leu Pro Glu lie Thr 
25 30 



<210> 11 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 11 

Ala Glu Gly His Ser Arg Leu His 

1 5' 

Leu Gin Leu Leu Thr Thr Lys Ser 
20 

Pro Ser 



Asp ser Lys Gly Gin Thr Lys Leu 
10 15 

Glu Gin Met Glu Pro Ser Pro Leu 
25 3 o 



<210> 12 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<:400> 12 

Pro Gly Ser Thr His Gly Thr Ser 
1 5 

His Arg Leu Leu Gin Asp Ser Ser 
20 

Thr Ala 



Leu Lys Glu Lys His Lys lie Leu 
10 15 

Ser Pro Val Asp Leu Ala Lys Leu 
25 30 



<210> 13 

<211> 31 

<212> PRT 

<213> Homo sapiens 



.99600 14A2J_> 



WO 99/60014 



5 
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<400> 13 

Glu Pro Ala Ser Pro Lys Lys Lys Glu Asn Ala Leu Leu Arg Tyr Leu 
15 10 15 

10 Leu Asp Lys Asp Asp Thr Lys Asp lie Gly Leu Pro Ser lie Thr 

20 25 30 



<210> 14 
15 <211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 14 

20 Ala Glu Asn Gin Arg Gly Pro Leu Glu Ser Lys Gly His Lys Lys Leu 
15 10 15 

Leu Gin Leu Leu Thr Cys Ser Ser Glu Asp Arg Gly His Ser Ser Leu 
20 25 30 



25 



Thr Asn 



30 <210> 15 
<211> 34 
<212> PRT 

<213> Homo sapiens 
35 <400> 15 

Thr Ser Asn Met His Gly Ser Leu Leu Gin Glu Lys His Arg lie Leu 
15 10 15 

His Lys Leu Leu Gin Asn Gly Asn Ser Pro Ala Glu Val Ala Lys lie 
40 20 25 30 

Thr Ala 



45 

<210> 16 
<211> 32 
<212> PRT 
<213> Homo sapiens 

<400> 16 

Glu Gin Leu Ser Pro Lys Lys Lys Glu Asn Asn Ala Leu Leu Arg Tyr 
15 10 15 

55 Leu Leu Asp Arg Asp Asp Pro Ser Asp Val Leu Ala Lys Lys Leu Gin 

20 25 30 



50 



60 

<210> 17 
<211> 34 
<212> PRT 
65 <213> Homo sapiens 

<400> 17 



BNSDOCtO: <WO. 



996001 4A2_L> 
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6 

Ala Glu Asn Gin Arg Gly Pro Leu Glu Ser Lys Gly His Lys Lys Leu 
1 5 10 is 

Leu Gin Leu Leu Thr Cys Ser Ser Asp Asp Arg Gly His Ser Ser Leu 
20 25 30 

Thr Asn 



<210> 18 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Thr Ser Asn Met His Gly Ser Leu 
1 5 

His Lys Leu Leu Gin Asn Gly Asn 
20 

Thr Ala 



Leu Gin Glu Lys His Arg lie Leu 
10 15 

Ser Pro Ala Glu Val Ala Lys lie 
25 30 



<210> 19 
<211> 32 
<212> PRT 

<213> Homo sapiens 



<400> 19 

Glu Gin Leu Ser Pro Lys Lys Lys Glu Asn Asn Ala Leu Leu Arg Tyr 
1 5 io 15 

Leu Leu Asp Arg Asp Asp Pro Ser Asp Ala Leu Ser Lys Glu Leu Gin 
20 25 30 



<210> 20 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 20 

Ser Glu Thr Pro Arg Gly Pro Leu Glu Ser Lys Gly His Lys Lys Leu 
15 10 15 

Leu Gin Leu Leu Thr Cys Ser Ser Glu Asp Arg Gly His Ser Ser Leu 
20 25 30 

Thr Asn 



<210> 21 
<211> 34 
<212> PRT 

<213> Homo sapiens 



.996001 4A2_I_> 
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5 <400> 21 

Thr Ser Asn Val His Gly Ser Leu Leu Gin Glu Lys His Arg lie Leu 
1 . 5 10 15 

His Lys Leu Leu Gin Asn Gly Asn Ser Pro Ala Glu Val Ala Lys lie 
10 20 25 ^ 30 



15 



20 



Thr Ala 



<210> 22 
<211> 32 
<212> PRT 

<213> Homo sapiens 



<400> 22 

Glu Gin Leu Ser Pro Lys Lys Lys Glu Asn Asn Ala Leu Leu Arg Tyr 
15 10 15 

25 Leu Leu Asp Arg Asp Asp Pro Ser Asp Ala Leu Ser Lys Glu Leu Gin 

20 25 30 



30 

<210> 23 
<211> 32 
<212> PRT 
35 <213> Homo sapiens 

<400> 23 

Ser Glu Gly Asp Ser Lys Tyr Ser Gin Thr Ser His Lys Leu Val Gin 
15 10 15 



Leu Leu Thr Thr Thr Ala Glu Gin Gin Leu Arg His Ala Asp lie Asp 
20 25 30 



40 



45 



<210> 24 

<211> 33 

50 <212> PRT 

<213> Homo sapiens 

<400> 24 

Thr Cys Pro Ser Ser His Ser Ser Leu Thr Glu Arg His Lys lie Leu 
55 1 5 10 15 

His Arg Leu Leu Gin Glu Gly Ser Pro Ser Asp lie Thr Thr Leu Ser 
20 25 30 

60 Val X 



<210> 25 

65 <211> 34 

<212> PRT 

<213> Homo sapiens 
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<400> 25 

Glu Leu Asp Ala Ala Lys Lys Lys Glu Ser Lys Asp His Gin Leu Leu 
1 5 10 15 

10 Arg Tyr Leu Leu Asp Lys Asp Glu Lys Asp Leu Arg Ser Thr Pro Asn 

20 25- 30 



15 



40 



45 



Leu Cys 



<210> 26 
<211> 34 
<212> PRT 
20 <213> Homo sapiens 

<220> 

<221> SITE 
<222> (2) 

25 <223> negatively charged amino acid 
<220> 

<221> SITE 
<222> (17) 
30 <223> hydrophobic amino acid 

<220> 

<221> SITE 

<222> (25) 

35 <223> negatively charged amino acid 

<400> 26 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Lys Leu 
1 5 10 15 



Xaa Gin Leu Leu Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa 



<210> 27 
<211> 34 
50 <212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
55 <222> (12) 

<223> positively charged amino acid 

<220> 

<221> SITE 
60 <222> (14) 

<223> positively charged amino acid 

<220> 

<221> SITE 

65 <222> (18) 

<223> positively charged amino acid 
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<220> 

<221> SITE 
<222> (28) 

<223> negatively charged amino acid 
<220> 

<221> SITE 
<222> (29) 

<223> hydrophobic amino acid 
<220> 

<221> SITE 
<222> (32) 

<223> hydrophobic amino acid 
<400> 27 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Glu Xaa His Xaa He Leu 
15 10 15 

His Xaa Leu Leu Gin Xaa Xaa Xaa Ser Pro Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa 



<210> 28 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (22) 

<223> positively charged amino acid 
<220> 

<221> SITE 
<222> (24) 

<223> negatively charged amino acid 
<220> 

<221> SITE 
<222> (33) 

<223> hydrophobic amino acid 
<400> 28 

Glu Xaa Xaa Xaa Xaa Lys Lys Lys Glu Xaa Xaa Xaa Xaa Xaa Leu Leu 
1 5 10 15 

Arg Tyr Leu Leu Asp Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

Xaa Xaa 



<210> 29 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 29 
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5 Thr Ser Leu Lys Glu Lys His Lys Leu Leu Arg Tyr Leu Leu Gin Asp 
1 5 10 15 

Ser Ser 

10 

<210> 30 
<211> 33 
<212> PRT 
15 <213> Homo sapiens 

<220> 

<221> MUTAGEN 
<222> (5) 
20 <223> Thr --> Arg (T281R) 

<220> 

<221> MUTAGEN 
<222> (8) 
25 <223> Val --> Arg (V284R) 

<220> 

<221> MUTAGEN 
<222> (9) 
30 <223> Asp --> Ala (D285A) 

<220> 

<221> MUTAGEN 
<222> (12) 
35 <223> Lys --> Ala (K288A) 

<220> 

<221> MUTAGEN 

<222> (22) 

40 <223> Cys Arg (C298R) 

<220> 

<221> MUTAGEN 
<222> (26) 
45 <223> He --> Arg (I302R) 

<220> 

<221> MUTAGEN 
<222> (30) 
50 <223> Lys Ala (K306A) 

<40O> 30 

Thr Pro Ala He Thr Arg Val Val Asp Phe Ala Lys Lys Leu Pro Met 
1 5 10 .15 

55 

Phe Cys Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly Cys 
20 25 30 

Cys 

60 



<210> 31 
<211> 12 
65 <212> PRT 

<213> Homo sapiens 
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5 <220> 

<221> MUTAGEN 

<222> (5) 

<223> Leu --> Arg (L454R) 

10 <220> 

<221> MUTAGEN 
<222> (7) 

<223> Leu --> Arg (L456R) 

15 <220> 

<221> MUTAGEN 
<222> (8) 

<223> Glu Lys (E457K) 

20 <400> 31 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
1 5 10 

25 <210> 32 
<211> 33 
<212> PRT 

<213> Homo sapiens 
30 <400> 32 

Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Lys Leu Pro Met 
1 5 10 



35 



15 



Phe Ser Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly Cys 
20 25 30 

Cys 



40 



45 



50 



55 



60 



65 



<210> 33 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 33 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 



5 



10 



<210> 34 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 34 

Thr Lys Cys He He Lys He Val Glu Phe Ala Lys Arg Leu Pro Gly 



10 15 



Phe Thr Gly Leu Ser He Ala Asp Gin He Thr Leu Leu Lys Ala Ala 
20 25 30 



Cys 



<210> 35 
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<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 35 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
1 5 10 



<210> 36 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 36 

Asp Lys Gin Leu Phe Thr Leu Val 
1 5 

Phe Ser Glu Leu Pro Leu Asp Asp 
20 

Trp 



Glu Trp Ala Lys Arg lie Pro His 
10 15 

Gin Val lie Leu Leu Lys Ala Gly 
25 30 



<210> 37 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 37 

Pro lie Asp Thr Phe Leu Met Glu Met Leu Glu Ala 
15 10 



<210> 38 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 38 

Val Glu Ala Val Gin Glu lie Thr Glu Tyr Ala Lys Asn lie Pro Gly 
1 5 10 15 

Phe lie Asn Leu Asp Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly 
20 25 30 

Val 



<210> 39 

<211> 12 

<212> PRT 

<213> Homo sapiens 

<400> 39 

Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys Asp 
15 10 



<210> 40 
<211> 33 



996001 4A2 I > 
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13 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Ser Tyr Ser He Gin Lys Val He Gly Phe Ala Lys Met He Pro Gly 



10 15 

Phe Arg Asp Leu Thr Ser Glu Asp Gin lie Val Leu Leu Lys Ser Ser 
20 25 30 

15 Ala 



<210> 41 
20 <211> 12 
<212> PRT 

<213> Homo sapiens 

<400> 41 
25 Lys Leu Thr Pro Leu Val 
1- 5 



Leu Glu Val Phe Gly Asn 
10 



<210> 42 
30 <211> 33 
<212> PRT 

<213> Homo sapiens 

<220> 
35 <221> MUTAGEN 
<222> (12) 

<223> Lys --> Ala (K362A) 

<220> 
40 <221> MUTAGEN 
<222> (26) 

<223> Val > Arg (V376R) 



45 



<400> 42 



Asp Arg Glu Leu Val His Met He Asn Trp Ala Lys Arg Val Pro Gly 



5 10 15 



Phe val Asp Leu Thr Leu His Asp Gin Val His Leu Leu Glu Cys Ala 
50 20 25 3 0 

Trp 



55 <210> 43 
<211> 12 
<212> PRT 

<213> Homo sapiens 

60 <220> 

<221> MUTAGEN 
<222> (8) 

<223> Glu --> Lys (E542K) 
65 <400> 43 

Pro Leu Tyr Asp Leu Leu Leu Glu Met Leu Asp Ala 
1 5 xo 
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<210> 44 
<211> 33 
<212> PRT 
10 <213> Homo sapiens 



15 



20 



<400> 44 

Gly Arg Gin Val lie Ala Ala Val Lys Trp Ala Lys Ala He Pro Gly 
15 10 15 

Phe Arg Asn Leu His Leu Asp Asp Gin Met Thr Leu Leu Gin Tyr Ser 
20 25 30 

Trp 



<210> 45 
<211> 12 
25 <212> PRT 

<213> Homo sapiens 

<400> 45 

Glu Phe Pro Glu Met Leu Ala Glu He He Thr Asn 
30 1 5 10 



<210> 46 
<211> 33 
35 <212> PRT 

<213> Homo sapiens 

<400> 46 

Glu Arg Gin Leu Leu Ser Val Val Lys Trp Ser Lys Ser Leu Pro Gly 
40 1 5 10 15 

Phe Arg Asn Leu His He Asp Asp Gin He Thr Leu He Gin Tyr Ser 
20 25 30 

45 Trp 



<210> 47 
50 <211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 47 

55 Glu Phe Pro Glu Met Met Ser Glu Val He Ala Ala 
15 10 



<210> 48 
60 <211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 48 

65 Gly Lys Gin Met He Gin Val Val Lys Trp Ala Lys Val Leu Pro Gly 
15 10 15 
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(54) Title: METHODS AND COMPOUNDS FOR MODULATING NUCLEAR RECEPTOR COACTIVATOR BINDING 

(57) Abstract: The present invention relates to methods and agonist/antagonist compounds for modulating nuclear receptor coacti- 
vator binding. The invention includes a method for identifying residues comprising a coactivator binding site for a nuclear receptor 
of interest. Also included is a method of identifying agonists and/or antagonists that bind to a coactivator binding site of a nuclear 
receptor of interest. Agonists and antagonists of coactivator binding to nuclear receptors also are provided. The invention is exem- 
plified by identification and manipulation of the coactivator binding site of the thyroid receptor (TR), and compounds that bind to 
these sites. The methods can be applied to other nuclear receptors including RAR, RXR, PPAR, VDR, ER, GR, PR, MR, and AR. 
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METHODS AND COMPOUNDS FOR MODULATING NUCLEAR RECEPTOR 
1 0 COACTIVATOR BINDING 
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Nos. DK 51083, DK 51281, and P41-RR010S1 and from the Army of the United States: Grant No. 
AIBS#562. The U.S. Government may have rights in this invention. 

INTRODUCTION 

20 

Technical Field 

The present invention relates to methods and compounds for modulating nuclear receptor 
coactivator binding. 

Background 

25 Cells contain receptors that can elicit a biological response by binding various molecules 

including proteins, hormones and/or drugs. Nuclear receptors represent a super family of proteins 
that are hormone/ligand-activated transcription factors that enhance or repress transcription in a cell 
type-, ligand- and promoter-dependent manner. The nuclear receptor family includes receptors for 
glucocorticoids (GRs), androgens (ARs), mineralocorticoids (MRs) ? progestins (PRs), estrogens 

30 (ERs), thyroid hormones (TRs), vitamin D (VDRs), retinoids (RARs and RXRs), peroxisomes 
(XPARs and PPARs) and icosanoids (IRs). The so-called "orphan receptors" for which ligands 
have not been identified are also part of the nuclear receptor superfamily, as they are structurally 
homologous to the classic nuclear receptors, such as steroid and thyroid receptors. 
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5 Although overall sequence conservation between nuclear receptors varies between different families 
• of receptors, sequence conservation between functional regions, or modules, of the receptors is 
high. For example, nuclear receptors can be organized into functional modules comprising an N- 
terminal transcriptional activation domain, a central DNA binding domain (DBD), and a C-terminal 
ligand binding domain (LBD). The LBD of nuclear receptors represents a hormone/ligand- 

1 0 dependent molecular switch. Binding of hormone to a nuclear receptor's LBD changes its ability to 
modulate transcription of DNA, although they may have transcription-independent actions. Nuclear 
receptors also bind proteins, such as chaperone complexes, compressors, or coactivators, that are 
involved in receptor function. Hormone binding by a nuclear receptor can increase or decrease 
binding affinity to these proteins, and can influence or mediate the multiple actions of the nuclear 

1 5 receptors on transcription. For example, nuclear receptors can stimulate transcription in response to 
hormone binding by recruiting coactivator proteins to promoters of responsive genes (Glass et al., 
Curr. Opin. Cell Biol. (1997) 9:222-32); andHorwitz et al., Mol. Endocrinol (1996) 70:1167-77). 

Coactivators of the pi 60 family mediate activity of a transcriptional activation domain, 
called AF2, that is part of the nuclear receptor's LBD. A few receptor mutants deficient in 
20 coactivator-dependent activation have been isolated (TR: Coilingwood et al. Proc. Natl. Acad. Sci. 
(1997) 94:248-253; VDR: Jurutka et al., J. Biol Chem. (1997) 227:14592-14599, Masayama et al., 
Mol. Endocrinol. (1997) 11: 1507-1 517; ER and RAR: Henttu et al., Mol. Cell Biol (1997) 1 7: 1832- 
1839). While these studies support the physiological relevance of the observed interaction, the 
structural and functional nature of the site to which coactivators bind has not been defined. 

25 The medical importance of nuclear receptors is significant. They have been implicated in 

breast cancer, prostate cancer, cardiac arrhythmia, infertility, osteoporosis, hyperthyroidism, 
hypercholesterolemia, obesity and other conditions. However, limited treatments are available and 
current agonist/antagonist drugs used to target nuclear receptors are ligands that bind to the 
receptor's LBD buried deep within the receptor. Although additional targets on nuclear receptors 

30 are desired for drug development, the structural and functional basis of such sites, including the 
coactivator binding site, has not been described. 

Accordingly, a need exists for identification and characterization of the coactivator binding 
sites of nuclear receptors, and molecules that affect their interaction with cellular coactivator 
proteins. This would provide a major new target for iterative drug design, synthesis, and selection. 
35 It also would be advantageous to devise methods and compositions for reducing the time required to 
discover compounds that target the coactivator binding site of nuclear receptors and administer 
them to organisms to modulate physiological processes regulated by nuclear receptors. 

2 
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10 



15 



20 



25 



30 



Wagner et al., (Nature (1995) 375:690-697) disclose the crystal structure of rat TR-alpha 
LBD. Various references disclose mutations in carboxyl-terminal helices of nuclear receptors 
(Henttu et al., si^ra; O'Dormell et al., Mol. Endocrinol. (1991) 5:94-99; Whitfield et al Mol. 
Endocrinol (1995) 9:1 166-79; Saatcioglu et al., Mol. Cell Biol (1997) 77:4687-95; Collingwood et 
al., suera; Kamei et al., Cell (1996) 55:403-14). Hong et al. (Proc. Natl. Acad Sci. USA (1996) 
i>5(10):498-49452) and Hong et al. (Mol. Cell. Biol. (1997) 77:2735-2744) disclose cloning and 
expression of GRIP1 coactivator. Torchia et al., (Nature (1997) 557:677-84), Le Douarin et al 
(EMBOJ(l996) 75:6701-6715) and Heery et al. (Nature (1997) 557:733-736) disclose sequence 
alignment of various coactivator proteins showing a (SEQ ID NO: 1) LxxLL motif. 

SUMMARY O F THE TTWFAJTTmvr 

The present invention relates to identification and manipulation of the coactivator binding 
site of nuclear receptors. Identification of this site permits design and obtention of compounds that 
buid to the coactivator binding site of nuclear receptors and modulate coactivator binding to the 
receptor. The compounds include agonists and antagonists that modulate nuclear receptor activity 
by promoting (agonists) or blocking (antagonists) hormone-dependent coactivator binding to the 
receptor, particularly antagonists. The compounds of the invention can be receptor-, cell- and/or 
tissue-specific. 

The present invention also includes protein cocrystals of nuclear receptors with a molecule 
bound to the coactivator binding site and methods for making them. The cocrystals provide means 
to obtam atomic modeling information of the specific amino acids and their atoms forming the 
coactivator binding site and that interact with molecules that bind to the site, such as coactivator 
The cocrystals also provide modeling information regarding the coactivator:nuclear receptor 
interaction, as well as the structure of coactivators bound thereto. 

The present invention further provides methods for identifying and designing small 
molecules that bind to the coactivator binding site using atomic models of nuclear receptors. The 
method involves modeling test compounds that fit spacially into a nuclear receptor coactivator 
binding site of interest using an atomic structural model comprising a nuclear receptor coactivator 
binding site or portion thereof, screening the test compounds in a biological assay characterized by 
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5 binding of a test compound to a nuclear receptor coactivator binding site, and identifying a test 
compound that modulates coactivator binding to the nuclear receptor. 

The invention also includes compositions and methods for identifying coactivator binding 
sites of nuclear receptors. The methods involve examining the surface of a nuclear receptor of 
interest to identify residues that modulate coactivator binding. The residues can be identified by 
10 homology to the coactivator binding site of human TR described herein. Overlays and 
superpositioning with a three dimensional model of a nuclear receptor LBD, or a portion thereof 
that contains a coactivator binding site, also can be used for this purpose. Additionally, alignment 
and/or modeling can be used as a guide for the placement of mutations on the LBD surface to 
characterize the nature of the site in the context of a cell. 

15 Also provided is a method of modulating the activity of a nuclear receptor. The method can 

be in vitro or in vivo. The method comprises administering, in vitro or in vivo f a sufficient amount 
of a compound that binds to the coactivator binding site. Preferred compounds bind to the site with 
greater affinity than coactivator proteins found in a cell of interest. Binding at this site, the 
compound can compete for binding of coactivator proteins, thereby inhibiting gene transcription, or 

20 in some cases promoting it, eVen when hormone is or is not bound. 

The invention fiirther includes a method for identifying an agonist or antagonist of 
coactivator binding to a nuclear receptor. The method comprises providing the atomic coordinates 
comprising a nuclear receptor coactivator binding site or portion thereof to a computerized 
modeling system; modeling compounds which fit spacially into the nuclear receptor coactivator 
25 binding site; and identifying in an assay for nuclear receptor activity a compound that increases or 
decreases activity of the nuclear receptor through binding the coactivator binding site. 

Also provided is a machine-readable data storage medium with information for constructing 
and manipulating an atomic model comprising a coactivator binding site or portion thereof. The 
medium comprises a data storage material encoded with machine readable data which, when using a 
30 machine programmed with instructions for using said data, is capable of displaying a graphical 
three-dimensional representation of a molecule or molecular complex for a nuclear receptor 
coactivator binding site. 

Also provided is a method of identifying a compound that selectively modulates the activity 
of one type of nuclear receptor compared to other nuclear receptors. The method is exemplified by 
35 modeling test compounds that fit spacially and preferentially into a nuclear receptor coactivator 

4 
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binding site of interest using an atomic structural model of a nuclear receptor coactivator binding 
site, selecting a compound that interacts with one or more residues of the coactivator binding site 
unique in the context of that site, and identifying in an assay for coactivator binding activity a 
compound that selectively binds to the coactivator binding site compared to other nuclear receptors. 
The unique features involved in receptor-selective coactivator binding can be identified by 
comparing atomic models of different receptors or isoforms of the same type of receptor. 

The invention finds use in the selection and characterization of peptide, peptidomimetic, as 
well as other small molecule compounds, such as small organic molecules, identified by the 
methods of the invention, particularly new lead compounds useful in treating nuclear receptor-based 
disorders. 



BRIEF DESrRTPTT QN OF TTTF DRAWINGS 

Figure 1 shows the specific effects of mutations on hTR.pi transcriptional activation in 
HeLa cells and correlation with effects on binding to GST-GRIP1. T 3 dependent activation of 
transcription of a reporter gene, expressed as the percentage of WT is plotted for each mutant 
GST-GRIP1 binding, analyzed by autoradiography after separation using 10% SDS-PAGE, was 
also expressed as the percentage of WT and plotted for each mutant. The GST-GRIP1 used 
included GRIP1 amino acids 721-1 121; the same results were obtained using a GST-GRIP 1 
construct including GRIP1 amino acids 563-1 121 (data not shown). 

Figure 2 shows that overexpression of full-length GRIP1 rescues loss of transcriptional 
25 activation by hTRpl mutants. Indicated amounts of the expression vector for full-length GRIP1, 
PSG5-GRIP1, is included in the cotransfections, which otherwise are performed as in Figure 1. ' 
The WT or different representative hTRpi mutants are indicated. 

Figure 3 shows specific hERa surface mutants cause loss of transcriptional activation in 
HeLa cells in parallel with their loss of m vitro GRIP1 binding. The fold E 2 activation, expressed as 
30 the percentage of WT, and the phosphorimager quantitation of in vitro binding of [ 35 S]-labeled 
hERa WT and mutants to GST-GRIP1 (GRIP1 amino acids 721-1 121) also expressed as the 
percentage of WT is plotted for each mutant. 
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5 Figure 4 shows a plot of the fold E2 activation observed when the indicated amounts of the 

full-length GRIP1 expression vector, pSG5-GRIPl, are added to the co-transfection experiment, 
which otherwise is performed as for Figure 3. The WT or different hERa mutants are indicated. 
The data represent the averages of three independent experiments, with standard deviations less 
than 10%. 

10 Figure 5 shows a CPK model of the TRot-LBD, indicating the LBD surface locations of 

mutations made in the full-length hTRpl . Mutated residues having no effect on GRIP1 binding or 
effect on activation in HeLa cells are shaded gray. Mutated residues with diminished GRIP1 and 
SRC- la binding and diminished activation in HeLa cells are colored to reflect chemical properties 
of the residues: red, blue (purple), and green indicate acidic, basic, and hydrophobic residues, 

15 respectively. The main chain structures of the TRcx- and TRp-LBDs are the same (data not shown). 

Figure 6 shows sequence alignment of amino acid residues of members of the pi 60 
coactivator family. Single amino acid designations are used. Members of the pi 60 coactivator 
family interact with the nuclear receptors through conserved (SEQ ED NO: 1) LxxLL motifs. 

Figure 7 shows binding affinity assays of GST-GRIP 1 constructs withNR-boxes 1, 2, 
20 and/or 3 and their interaction with TR LBD. GRIP-1 NR boxes 1,2 and 3 interact differently with 
TRp LBD. Single letter designations are used for the amino acids. 

Figure 8 shows binding affinity assays of GST-GRIP 1 constructs withNR-boxes 1, 2, 
and/or 3 and their interaction with TR and GR LBDs. TR and GR differ in their interactions with 
GRIP-1. 

25 Figure 9 shows binding affinity assays for NR-box 2- and 3-peptides and GRIP1 and their 

interaction with TR LBD. NR box 2- and 3 -containing peptides reproduce the affinity and 
specificity of the NR interaction domain. 

Figure 10 shows binding affinity assays for NR-box 2- and 3-peptides and their interaction 
with TR LBD. Sequence adjacent to the (SEQ ID NO: 1) LxxLL motif modulate the affinity of 
30 NR-box-TRp LBD interactions. 
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Figure 11 shows binding affinity assays for mutant GRIP1 and NR-box 2- and 3-peptides 
and their interaction with TR LBD. The individual leucine residues of the (SEQ ID NO: 1) LxxLL 
motif are crucial for binding of the GRIP-1 NR interaction domain to TRp LBD. 



Figure 12 shows the contents of the asymmetric unit of the crystallized hTRB LBD:GRIP1 
NR-box 2 peptide complex. The crystal lattice consists of a repeating unit containing a 2:2 comple 

10 of hTR LBD and GRIP1 site 2 peptide. Positions of the two GRIP1 site 2 peptides are boxed, in 
green (sitel), and red (site 2), with the peptides drawn as a C-alpha trace. The two NCS related 
monomers of the hTR LBD are shown as a secondary structure ribbon drawing, with monomer 1 ir 
light grey, and monomer 2 in dark grey. The side chains of the hydrophobic residues 1689, L690, 
L693, L694 of the GRIP1 NR-box 2 peptides are drawn to emphasize those interactions observed i 

15 both bound peptides. 



Figure 13 shows a ribbon diagram depicting the interaction of the GRIPl NR-box 2 peptide • 
with the hTRB LBD. The GRIPl NR-box 2 peptide (dark grey) forms three turns of cc-helix, and 
binds the hTR LBD (light gray) in a hydrophobic cleft defined by helices H3, H4, H5, and H12. 
Portions of the hTRB LBD, and the neighboring monomer, are omitted for clarity. 

20 Figure 14 shows interface between the GRIPl NR-box 2 peptide and the hTRB LBD. Side 

chains of residues of the hTRB LBD within 4.5A of the GRIP-1 NR-box 2 peptide are labeled. The 
color of the individual side chains reflects the chemical nature of the residue: acidic residues are 
red, basic residue are blue, aliphatic residues are green, aromatic residues are brown, and polar 
residues are orange. The peptide is depicted as a C-alpha trace with the side chains of (SEQ ID 

25 NO: 2) ILxxLL motif shown explicitly. 

Figure 15 shows residues in the hTRB LBD that are necessary for transactivation. The 
transactivation mutations are mapped onto the interface between the GRIPl NR-box 2 peptide and 
the hTRB LBD. 



Figure 16 shows molecular surface of the hTR LBD. The side chains of the leucines resides 
30 fit within a hydrophobic groove formed from helices H3, H5, and HI 2, while the side chain of the 
non-conserved isoleucine residue packs against the outside edge of the groove. The remainder of 
the peptide is shown as main chain. 
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5 Figure 17 shows complementarity between the (SEQ ID NO: 1) LxxLL motif and the 

surface of the hTR LBD. The side chains of the (SEQ ID NO: 2) ILxxLL motif are shown in a CPK 
representation, with the main chain of the peptide drawn as a C-alpha trace. The three leucince 
residues fit into pockets of the coactivator binding site of the hTRB LBD, depicted as mesh, while 
the isoleucine residue rests on the edge of the site's cleft 

10 Figure 18 shows the coactivator binding site cleft, one side of which is formed by 

conformationally hormone-responsive residues. On the left is a view of the TR-LBD molecular 
surface showing the concave surfaces in gray. The cavity is shown at the center of the figure. On 
the right is shown a CPK model of the TR-LBD, overlaid with a molecular surface view, which is 
restricted to a 12A radius of the hydrophobic cavity. Mutated residues of the coactivator binding 

15 site that are hormone-insensitive (V284, K288, 1302 and K306) are located on one side of the cleft 
and are colored yellow. Mutated CBS residues likely undergo a conformational change upon 
hormone binding (L454 and E457) are located on the opposite side of the cleft and are colored red. 

Figure 19 shows alignment of amino acid sequences (single letter amino acid designations) 
containing residues that form the coactivator binding sites of several nuclear receptors. The boxes 
20 represent residues of alpha-helix (H3, H4, H5, H6 and H12); lower case letters "h" and "q" 
represent hydrophobic and polar residues, respectively. 
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5 DESCRIPTION OF SPECIFIC EMBODIMENTS 

The present invention provides methods and compositions for identifying compounds that 
modulate nuclear receptor activity. The compounds can be nuclear receptor agonists or antagonists 
that bind to the coactivator binding site (and that act as mimetics to the coactivator in this regard), 
and promote (agonists) or block (antagonists) binding of the coactivator to the target nuclear 
10 receptor. Compounds that bind to the coactivator binding site also are provided. The compounds 
can be natural or synthetic. Preferred compounds are small organic molecules, peptides and 
peptidomimetics (e.g., cyclic peptides, peptide analogs, or constrained peptides). 

As described in the Examples, mutagenesis and coactivator binding studies, coupled with 
analysis of atomic models derived from cocrystals, reveals for the first time a previously unknown 
structure for nuclear receptors, the coactivator binding site. By "coactivator binding site" is 
intended a structural segment or segments of nuclear receptor polypeptide chain folded in such a 
way so as to give the proper geometry and amino acid residue conformation for binding a 
coactivator. This is the physical arrangement of protein atoms in three-dimensional space forming a 
coactivator binding site pocket or cavity. Residues forming the site are amino acids corresponding 
to (i.e., the same as or equivalent to) human TR residues of C-terminal helix 3 (Ile280, Thr281, 
Val283, Val284, Ala287, and Lys288), helix 4 (Phe293), helix 5 (Gln301, Ile302, Leu305, Lys306), 
helix 6 (Cys309), and helix 12 (Leu454, Glu457, Val458 and Phe459). The coactivator binding site 
is highly conserved among the nuclear receptor super family (Figure 19). Thus, this site 
corresponds to a surprisingly small cluster of residues on the surface of the LBD that form a 
prominent hydrophobic cleft. The hydrophobic cleft is formed by hydrophobic residues 
corresponding to human TR residues of C-terminal helix 3 (Ile280, Val283, Val284, and Ala287), 
helix 4 (Phe293), helix 5 (Ile302 and Leu305), helix 6 (Cys309), and helix 12 (Leu454, Val458 and 
Phe459). The hydrophobic cleft of the coactivator binding site also is highly conserved among the 
nuclear receptor super family (Figure 19). 

The invention also includes compositions and methods for identifying coactivator binding 
sites of nuclear receptors. The methods involve examining the surface of a nuclear receptor of 
interest to identify residues that modulate coactivator binding. The residues can be identified by 
homology to the coactivator binding site of human TR described herein. A preferred method is 
alignment with the residues of any nuclear receptor corresponding to (i.e., equivalent to) human TR 
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residues of the C-terrninal helix 3 (116280, Thr281, Val283, Val284, Ala287, and Lys288), helix 4 
(Phe293), helix 5 (Gln301, Ile302, Leu305, Lys306), helix 6 (Cys309), and helix 12 (Pro453, 
Leu454, Glu457, Val458 and Phe459). Overlays and superpositioning with a three-dimensional 
model of a nuclear receptor LBD, or a portion thereof that contains a coactivator binding site, also 
can be used for this purpose. For example, three-dimensional structures of TR, RAR, RXR and ER 
LBDs can be used for this purpose. For example, nuclear receptors identifiable by homology 
alignment include normal nuclear receptors or proteins structurally related to nuclear receptors 
found in humans, natural mutants of nuclear receptors found in humans, normal or mutant receptors 
found in animals, as well as non-mammalian organisms such as pests or infectious organisms, or 
viruses. 

15 Alignment and/or modeling also can be used as a guide for the placement of mutations on 

the LBD surface to characterize the nature of the site in the context of a cell. Selected residues are 
mutated to preserve global receptor structure and solubility. To destroy the coactivator binding 
interaction, preferred mutations are to charged residues (e.g., Arg, Lys, or Glu) on the basis that 
bulky, surface charged residues might disrupt coactivator binding, yet preserve global receptor 

20 structure and solubility. Mutants can be tested for coactivator binding as well as the relative change 
in strength of the binding interaction. Ligand-dependent coactivator interaction assays also can be 
tested for this purpose, such as those described herein. 

Compounds that bind to the coactivator binding site of nuclear receptors can be identified by 
computational modeling and/or screening. For example, coactivator agonists or antagonists can be 
25 identified by providing atomic coordinates comprising a nuclear receptor coactivator binding site or 
portion thereof to a computerized modeling system, modeling them, and identifying compounds that 
fit spacially into the coactivator binding site. By a "portion thereof is intended the atomic 
coordinates corresponding to a sufficient number of residues or their atoms of the coactivator 
binding site that interact with a compound capable of binding to the site. This includes receptor 

30 residues having an atom within 4.5A of a bound compound or fragment thereof. For instance, 
human TR residues V284, Phe293, Ile302, Leu305 and Leu454 contain side chain atoms that are 
within 4.5 A, and interact with, hydrophobic residues of a (SEQ ID NO: 1 ) LxxLL motif of an NR- 
box 2 coactivator peptide. As another example, an atomic structural model utilized for 
computational modeling and/or screening of compounds that bind to the coactivator binding site 

35 may include a portion of atomic coordinates of amino acid residues corresponding to the site 

composed of residues of human thyroid receptor selected from Val284, Lys288, Ile302, Lys306, 
Leu454 and Glu457, or their structural and functional equivalents found in other receptors. Thus, 
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for example, the atomic coordinates provided to the modeling system can contain atoms of the 
nuclear receptor LBD, part of the LBD such as atoms corresponding to the coactivator binding site 
or a subset of atoms useful in the modeling and design of compounds that bind to a coactivator 
binding site. 

The atomic coordinates of a compound that fits into the coactivator binding site also can be 
used for modeling to identify compounds or fragments that bind the site. By "modeling" is 
intended quantitative and qualitative analysis of molecular structure/function based on atomic 
structural information and receptor-coactivator agonists/antagonists interaction models. This 
includes conventional numeric-based molecular dynamic and energy minimization models, 
interactive computer graphic models, modified molecular mechanics models, distance geometry and 
other structure-based constraint models. Modeling is preferably performed using a computer and 
may be further optimized using known methods. By "fits spacially" is intended that the three- 
dimensional structure of a compound is accommodated geometrically by a cavity or pocket of a 
nuclear receptor coactivator binding site. 

Compounds of particular interest fit spacially and preferentially into the coactivator binding 
site. By "fits spacially and preferentially" is intended that a compound possesses a three- 
dimensional structure and conformation for selectively interacting with a nuclear receptor 
coactivator binding site. Compounds that fit spacially and preferentially into the coactivator 
binding site interact with amino acid residues forming the hydrophobic cleft of this site. In 
particular, the hydrophobic cleft of the coactivator binding site comprises a small cluster of 
hydrophobic residues. The site also contains polar or charged residues at its periphery. The present 
invention also includes a method for identifying a compound capable of selectively modulating 
coactivator binding to different nuclear receptors. The method comprises the steps of modeling test 
compounds that fit spacially and preferentially into the coactivator binding site of a nuclear receptor 
of interest using an atomic structural model of a nuclear receptor, screening the test compounds in a 
biological assay for nuclear receptor activity characterized by preferential binding of a test 
compound to the coactivator binding site of a nuclear receptor, and identifying a test compound that 
selectively modulates the activity of a nuclear receptor. Such receptor-specific compounds are 
selected that exploit differences between the coactivator binding sites of one type of receptor versus 
a second type of receptor, such as the differences depicted in Figure 19. 

The invention also is applicable to generating new compounds that distinguish nuclear 
receptor isoforms. This can facilitate generation of either tissue-specific or function-specific 
compounds. For instance, GR subfamily members have usually one receptor encoded by a single 
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5 gene, although there are exceptions. For example, there are two PR isoforms, A and B, translated 
from the same mRNA by alternate initiation from different AUG codons. There are two GR forms, 
one of which does not bind ligand. This method is especially applicable to the TR subfamily which 
usually has several receptors that are encoded by at least two (TR: a, p) or three (RAR, RXR, and 
PPAR: a, P, y) genes or have alternate RNA splicing. 

10 The receptor-specific compounds of the invention preferably interact with conformationally 

constrained residues of the coactivator binding site that are conserved among one type of receptor 
x compared to a second type of receptor. "Conformationally constrained" is intended to refer to the 
three-dimensional structure of a chemical or moiety thereof having certain rotations about its bonds 
fixed by various local geometric and physical-chemical constraints. Conformationally constrained 

15 structural features of a coactivator binding site include residues that have their natural flexible 
conformations fixed by various geometric and physical-chemical constraints, such as local 
backbone, local side chain, and topological constraints. These types of constraints are exploited to 
restrict positioning of atoms involved in receptor-coactivator recognition and binding. 

For instance, comparison of sequences of the GR and TR coactivator interaction surface 
20 shows a highly negatively charged sequence at the C-terminal end of TR helix 12 (E460 and D461) 
that is neutral in the equivalent positions in GR helix 1 2 (GR residues T788 and N759, 
corresponding to TR residue positions 460 and 461, as depicted in Figure 19). As described in the 
Examples, the cocrystal of the hTRB LBD complexed with the GRIP1 NR-box 2 peptide shows that 
TR residues E460 and D461 interact with positively charged residues of the NR-box 2 peptide. 
25 Also, when comparing the RAR LBD structure to that of the TR LBD, conformation of helix 12 
differs slightly, whereas helices 3, 4, 5 and 6 are substantially the same. Thus, differences in helix 
12, particularly charge differences at the C-terminal end of the helix, may modulate preferential 
interaction of TR for NR-box 2 containing coactivators. As further demonstrated in the Examples, 
TR and GR differ in their specificity for different NR-boxes containing the conserved (SEQ ID NO: 
30 1) LxxLL motif found in members of the pl60 family of coactivator proteins. As also demonstrated 
in the Examples, GR but not TR is able to interact with peptides containing the hydrophobic 
interaction motifs of p53 (SEQ ID NO: 3; FxxLW) and VP16 (SEQ ID NO: 4; FxxAL). Thus, TR 
exhibits preferential interaction with NR-box peptides comprising the (SEQ ID NO: 1) LxxLL 
motif, but GR does not discriminate and can bind peptides containing a generic amphipathic helix 
35 motif. Accordingly, these real differences among the various nuclear receptors can be exploited in 
the identification and design of compounds that modulate coactivator binding to one nuclear 
receptor compared to another. 
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For modeling, docking algorithms and computer programs that employ them can be used to 
identify compounds that fit into the coactivator binding site. For example, docking programs can be 
used to predict how a small molecule of interest can interact with the nuclear receptor coactivator 
binding site. Fragment-based docking also can be used in building molecules de novo inside the 
coactivator binding site, by placing chemical fragments that complement the site to optimize 
intermolecular interactions. The techniques can be used to optimize the geometry of the binding 
interactions. This design approach has been made possible by identification of the coactivator 
binding site structure thus, the principles of molecular recognition can now be used to design a 
compound which is complementary to the structure of this site. Compounds fitting the coactivator 
binding site serve as a starting point for an iterative design, synthesis and test cycle in which new 
compounds are selected and optimized for desired properties including affinity, efficacy, and 
selectivity. For example, the compounds can be subjected to addition modification, such as 
replacement and/or addition of R-group substituents of a core structure identified for a particular 
class of binding compounds, modeling and/or activity screening if desired, and then subjected to 
additional rounds of testing. 

Computationally small molecule databases can be screened for chemical entities or 
compounds that can bind in whole, or in part, to a nuclear receptor coactivator binding site of 
interest. In this screening, the quality of fit of such entities or compounds to the binding site may be 
judged either by shape complementarity (DesJalais et al., J. Med. Chem. (1988) 57:722-729) or by 
estimated interaction energy (Meng et al., J. Comp. Chem. (1992) 73:505-524). The molecule 
databases include any virtual or physical database, such as electronic and physical compound 
library databases, and are preferably used in developing compounds that modulate coactivator 
binding. 

Compounds can be designed intelligently by exploiting available structural and functional 
information by gaining an understanding of the quantitative structure-activity relationship (QSAR), 
using that understanding to design new compound libraries, particularly focused libraries having 
chemical diversity of one or more particular groups of a core structure, and incorporating any 
structural data into that iterative design process. For example, one skilled in the art may use one of 
several methods to screen chemical entities or fragments for their ability to associate with the 
coactivator binding site of a nuclear receptor of interest. This process may begin by visual 
inspection of, for example, the coactivator binding site on the computer screen. Selected fragments 
or chemical entities may then be positioned into all or part of the site. Docking may be 
accomplished using software such as Quanta and Sybyl, followed by energy minimization and 
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5 molecular dynamics with standard molecular mechanics force-fields, such as CHARMM and 
AMBER. 

For example, compounds and/or fragments can be designed to fill up the hydrophobic cleft, 
the pocket deep within the cleft, the upper end of the site, and/or the lower end of the site. Residues 
comprising a coactivator binding site, when defined by the user as those residues having an atom 

10 within 4.5 A of an atom of a bound chemical entity, can be modeled to look for energetic 
contributions and interaction with the bound chemical entity. For example, a compound or 
fragment can be designed to contain hydrophobic groups that interact with hydrophobic residues of 
the coactivator binding site. As described in the examples, human TR residues V284, Phe293, 
Ile302, Leu305 and Leu454 contain side chain atoms that are within 4.5A, and interact with, 

1 5 hydrophobic residues of a (SEQ ID NO: 1) LxxLL motif of an NR-box 2 coactivator peptide. Thus, 
for example, peptides and/or peptide mimetics having a hxxhh motif, where "h" is a hydrophobic 
residue and x is any residue, can be constructed. Small organic molecules that mimic one or more 
of these particular interactions also can be designed, for example, by including one or more R- 
groups that are hydrophobic and fit into the site. 

20 Specialized computer programs may also assist in the process of selecting chemical entity 

fragments or whole compounds. These include: GRID (Goodford, J. Med. Chem. (1985) 25:849- 
857; available from Oxford University, Oxford, UK); MCSS (Miranker et al., Proteins: Structure, 
Function and Genetics, (1991) 77:29-34; available from Molecular Simulations, Burlington, MA); 
AUTODOCK (Goodsell et al., Proteins: Structure, Function and Genetics (1990) 5:195-202; 

25 available from Scripps Research Institute, La Jolla, CA); and DOCK (Kuntz et al, J. Mol BioL 
(1982) 161 :269-288; available from University of California, San Francisco, CA). 

Additional commercially available computer databases for small molecular compounds 
include Cambridge Structural Database and Fine Chemical Database (Rusinko, Chem. Des. Auto. 
News (1993) 5:44-47). 

30 Once suitable chemical entities or fragments have been selected, they can be assembled into 

a single compound. Assembly may be proceeded by visual inspection of the relationship of the 
fragments to each other on the three-dimensional image displayed on a computer screen in relation 
to the structure coordinates of a nuclear receptor. This can be followed by manual model building 
using software such as Quanta or Sybyl. 
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Useful programs to aid one of skill in the art in connecting the individual chemical entities 
or fragments include: CAVEAT (Bartlett et al., "CAVEAT: A Program to Facilitate the Structure- 
Derived Design of Biologically Active Molecules", In: Molecular Recognition in Chemical and 
Biological Problems", Special Pub., Royal Chem. Soc. (1989) 78: 182-196; CAVEAT is available 
from the University of California, Berkeley, CA); 3D Database systems such as MACCS-3D (MDL 
Information Systems, San Leandro, CA; reviewed in Martin, J. Med. Chem. (1992) 35:2145-2154); 
and HOOK (available from Molecular Simulations, Burlington, MA). 

In addition to building a compound in a step-wise fashion, one fragment or chemical entity 
at a time as described above, compounds that bind to a coactivator binding site of interest also may 
be designed as a whole or de novo using either an empty coactivator binding site or optionally 
including some portions) of a molecule known to binds to the site, such as an NR-box type peptide. 
These methods include: LUDI (Bohm, J. Comp. Aid. Molec. Design (1992) 5:61-78; LUDI is 
available from Biosym Technologies, San Diego, CA); LEGEND (Nisbibata et al., Tetrahedron 
(1991) 47:8985; LEGEND is available from Molecular Simulations, Burlington, MA); and 
LeapFrog (available from Tripos Associates, St. Louis, MO). 

Other molecular modeling techniques may also be employed in accordance with this 
invention. See, for example, Cohen et al., J. Med. Chem. (1990) 33:883-894); Navia et al., Curr 
Opin. Struct. Biol. (1992) 2:202-210). For example, where the structures of test compounds are 
known, a model of the test compound may be superimposed over the model of the structure of the 
invention. Numerous methods and techniques are known in the art for performing this step, any of 
which may be used. See, for example, Farmer, -Drug Design," Ariens, E.J., ed. 5 10:119-143 
(Academic Press, New York, 1980); U.S. Patent No. 5,331,573; U.S. Patent No. 5,500,807; 
Verlinde, Structure, (1994) 2:577-587); and Kuntz et al., Science, (1992) 257:1078-1082). The 
model building techniques and computer evaluation systems described herein are not a limitation on 
the present invention. 

Using these computer modeling systems a large number of compounds may be quickly and 
easily examined and expensive and lengthy biochemical testing avoided. Moreover, the need for 
actual synthesis of many compounds can be substantially reduced and/or effectively eliminated. 

Compounds identified through modeling can be screened in an assay characterized by 
binding of the compound to a coactivator binding site of interest for coactivator binding activity, 
such as a biologically based assay. Screening can be in vitro and/or in vivo. Preferred assays 
include cell-free competition assays and cell culture based assays. The biological screening 
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preferably centers on activity-based response models, binding assays (which measure how well a 
compound binds to the receptor), and bacterial, yeast and animal cell lines (which measure the 
biological effect of a compound in a cell). The assays can be automated for high capacity - high 
throughput screening (HTS) in which large numbers of compounds can be tested to identify 
compounds with the desired activity. 

As an example, in vifro binding assays can be performed in which compounds are tested for 
their ability to block the binding of a coactivator protein, fragment, fusion or peptide thereof, to a 
coactivator binding site of interest. For cell and tissue culture assays, they may be performed to 
assess a compound's ability to block function of cellular coactivators, such as members of the pi 60 
family of coactivator proteins, such as SRC-1, AIB1, RAC3, p/CIP, and GRIP1 and its homologues 
TIF 2 and NcoA-2, and those that exhibit receptor and/or isoform-specific binding affinity. In a 
preferred embodiment, compounds of the invention bind to a nuclear receptor coactivator binding 
site with greater affinity than the cellular coactivator proteins. Tissue profiling and appropriate 
animal models also can be used to select compounds. Different cell types and tissues also can be 
used for these biological screening assays. Suitable assays for such screening are described herein 
and in Shibata et al. {Recent Prog. Horm. Res. 52/141-164 (1997)); Tagami et al. (Mol Cell Biol 
(1997) 77(5):2642-2648); Zhu et al. (J. Biol Chem. (1997) 272(1 4):9048-9054); Lin et al. (Mol 
Cell Biol (1997) i7(10):6131-6138); Kakizawa et al. (J. Biol Chem. (1997) 272(3S):23799-23804); 
and Chang et al. (Proa Natl Acad Set USA (1997) P4(17):9040-9045), which references are 
incorporated herein in their entirety by reference. For example, coactivators or binding fragments 
thereof can be expressed and/or assayed for binding as for GRIP1 (Hong et al., MCB supra : and 
Hong et al., PNAS suvrd) and/or SRC-1 (Spencer et al., Nature (1997) 389: 194-198; Onate et al., 
Science (1995) 270:1354-1357), incorporated by reference. 

The compounds selected can have agonist and/or antagonistic properties. The compounds 
also include those that exhibit new properties with varying mixtures of agonist and antagonist 
activities, depending on the effects of altering coactivator binding in the context of different 
activities of nuclear receptors, either hormone-dependent or hormone-independent, which are 
mediated by proteins other than coactivators, and which interact with the receptors at locations 
other than the coactivator binding site. The compounds also include those, which through their 
binding to receptor locations that are conformationally sensitive to hormone binding, have allosteric 
effects on the receptor by stabilizing or destabilizing the hormone-bound conformation of the 
receptor, or by directly inducing the same, similar, or different conformational changes induced in 
the receptor by the binding of hormone. 
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Of particular interest is use of such compounds in a method of modulating nuclear receptor 
activity in a mammal by administering to a mammal in need thereof a sufficient amount of a 
compound that fits spatially and preferentially into a coactivator binding site of a nuclear receptor 
of interest. By "modulating" is intended increasing or decreasing activity of a nuclear receptor. For 
example, pre-clinical candidate compounds can be tested in appropriate animal models in order to 
measure efficacy, absorption, pharmacokinetics and toxicity following standard techniques known 
in the art. Compounds exhibiting desired properties are then tested in clinical trials for use in 
treatment of various nuclear receptor-based disorders. These include ER-based disorders, such as 
postmenopausal symptoms and cancer resulting from loss of estrogen production, and osteoporosis 
and cardiovascular disease stemming from traditional estrogen replacement therapy. Others include 
TR-based disorders including cardiovascular disease, metabolic disorders, hyperthyroidism, 
glaucoma and skin disorders. GR-based disorders include Type II diabetes and inflammatory 
conditions such as rheumatic diseases. 

The invention also provides for cocrystals made from nuclear receptor ligand binding 
domains with a molecule bound to the coactivator binding site. As exemplified in the Examples, 
TR LBDs are co-crystallized with a peptide molecule comprising a coactivator NR-box 2 peptide 
sequence bound to the coactivator binding site, and the hormone/ligand T 3 . 

Crystals are made from purified nuclear receptor LBDs that are usually expressed by a cell 
culture, such as E. coli. Preferably, different crystals (cocrystals) for the same nuclear receptor are 
separately made using different coactivators-type molecules, such as protein fragments, fusions or 
25 small peptides. The coactivator-type molecules preferably contain NR-box sequences necessary for 
binding to the coactivator binding site, or derivatives of NR-box sequences. Other molecules can 
be used in co-crystallization, such as small organics that bind to the coactivator or hormone binding 
site(s). Heavy atom substitutions can be included in the LBD and/or a co-crystallizing molecule. 

After the three dimensional structure of the cocrystal is determined, the structural 
30 information can be used in computational methods to design synthetic compounds for the nuclear 
receptor, and further structure-activity relationships can be determined through routine testing using 
the assays described herein and known in the art. 

Since nuclear receptor LBDs may crystallize in more than one crystal form, the structure 
coordinates of such receptors or portions thereof, as provided in Appendix 1, are particularly useful 
35 for solving the structure of those other crystal forms of nuclear receptors. They may also be used to 
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5 solve the structure of mutants or co-complexes of nuclear receptors having sufficient structural 
similarity. 

One method that may be employed for this purpose is molecular replacement. In this 
method, the unknown crystal structure, may be determined using the structure coordinates of this 
invention as provided in Appendix 1, This method will provide an accurate structural form for the 
10 unknown crystal more quickly and efficiently than attempting to determine such information ab 
initio. 

Atomic coordinate information gleaned from the crystals of the invention can be stored. In a 
preferred embodiment, the information is provided in the form of a machine-readable data storage 
medium. This medium contains information for constructing and/or manipulating an atomic model 

15 of a coactivator binding site or portion thereof. For example, the machine readable data for the 
coactivator binding site comprises structure coordinates of amino acids corresponding to human TR 
amino acids selected from C-terminal helix 3 (Ile280, Thr281 3 Val283, Val284, Ala287 5 and 
Lys288), helix 4 (Phe293), helix 5 (Gln301, Ue302, Leu305, Lys306), helix 6 (Cys309), and helix 
12 (Pro453, Leu454, Glu457, Val458 and Phe459), or a honiologue of the molecule or molecular 

20 complex comprising the site. The homologues comprise a coactivator binding site that has a root 
mean square deviation from the backbone atoms of the amino acids of not more than 1 .5 A. A 
preferred molecule or complex represents a compound bound to the coactivator binding site. 

The machine-readable data storage medium can be used for interative drug design and 
molecular replacement studies. For example, a data storage material is encoded with a first set of 

25 machine-readable data that can be combined with a second set of machine-readable data. For 
molecular replacement, the first set of data can comprise a Fourier transform of at least a portion of 
the structural coordinates of the nuclear receptor or portion thereof of interest, and the second data 
set comprises an X-ray diffraction pattern of the molecule or molecular complex of interest Using 
a machine programmed with instructions for using the first and second data sets a portion or all of 

30 the structure coordinates corresponding to the second data can be determined- 
Protein for crystals and assays described herein can be produced using expression and 
purification techniques described herein and known in the art. For example, high level expression 
of nuclear receptor LBDs can be obtained in suitable expression hosts such as E. coli. Expression 
of LBDs in R. coli, for example, includes the TR LBD and other nuclear receptors, including 

35 members of the steroid/thyroid receptor superfamily, such as the receptors ER, AR, MR, PR, RAR, 
RXR and VDR. Yeast and other eukaryotic expression systems can be used with nuclear receptors 
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5 that bind heat shock proteins as these nuclear receptors are generally more difficult to express in 
bacteria, with the exception of ER, which can be expressed in bacteria. Representative nuclear 
receptors or their ligand binding domains have been cloned and sequenced: human RAR-ct, human 
RAR-Y, human RXR-a, human RXR-a, human PPAR-oc, human PPAR-(3, human PPAR-y, human 
VDR, human ER (as described in Seielstad et ai 9 Molecular Endocrinol, (1995) P:647-658, 
10 incorporated herein by reference), human GR, human PR, human MR, and human AR. The LBD 
for each of these receptors has been identified. 

Coactivator proteins can be expressed using techniques known in the art, particularly 
members of the pi 60 family of coactivator proteins that have been cloned and/or expressed 
previously, such as SRC-1, AIB1, RAC3, p/CIP, and GRIP1 and its homologues TIF 2 and NcoA-2. 
15 A preferred method for expression of coactivator protein is to express a fragment that retains 
transcriptional activation activity using the "yeast 2-hybrid" method as described by Hong et al. 
(PNAS supra; and MCB supra), for GRIP1 expression, which reference is herein incorporated by 
reference. 

The proteins can be expressed alone, as fragments of the mature or full-length sequence, or 
20 as fusions to heterologous sequences. For example, TR can be expressed without any portion of the 
DBD or amino-terminal domain. Portions of the DBD or amino-terminus can be included if further 
structural information with amino acids adjacent the LBD is desired. Generally, for the TR the 
LBD used for crystals will be less than 300 amino.acids in length. Preferably, the TR LBD will be 
at least 150 amino acids in length, more preferably at least 200 amino acids in length, and most 
25 preferably at least 250 amino acids in length. For example the LBD used for crystallization can 
comprise amino acids spanning from Met 122 to Val 410 of the rat TR-a or Glu 202 to Asp 461 of 
the human TR-p. 

Typically the LBDs are purified to homogeneity for crystallization. Purity of LBDs can be 
measured with sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE), mass 
30 spectrometry (MS) and hydrophobic high performance liquid chromatography (HPLC) The 
purified LBD for crystallization should be at least 97.5 % pure, preferably at least 99.0% pure, more 
preferably at least 99.5% pure. 

Initially, purification of the unliganded receptor can be obtained by conventional techniques, 
such as hydrophobic interaction chromatography (HPLC), ion exchange chromatography (HPLC), 
35 and heparin affinity chromatography. 
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5 To achieve higher purification for improved crystals of nuclear receptors, especially the TR 

subfamily and TR, the receptors can be ligand-shift-purified using a column that separates the 
receptor according to charge, such as an ion exchange or hydrophobic interaction column, and then 
bind the eluted receptor with a ligand, especially an agonist. The ligand induces a change in the 
receptor's surface charge such that when re-chromatographed on the same column, ligand-bound 
10 receptor is separated from unliganded receptor. Usually saturating concentrations of ligand are used 
in the column and the protein can be preincubated with the ligand prior to passing it over the 
column. The structural studies detailed herein indicate the general applicability of this technique 
for obtaining super-pure nuclear receptor LBDs for crystallization. 

Purification can also be accomplished by use of a purification handle or "tag," such as with 
15 at least one histidine amino acid engineered to reside on the end of the protein, such as on the N- 
terminus, and then using a nickel or cobalt chelation column for purification. (Janknecht et al., 
Proc. Natl Acad Set USA, (1991) 55:8972-8976) incorporated by reference. 

Typically purified LBD, such as TR LBD, is equilibrated at a saturating concentration of 
ligand at a temperature that preserves the integrity of the protein. Ligand equilibration can be 

20 established between 2 and 37°C, although the receptor tends to be more stable in the 2-20°C range. 
Preferably crystals are made with the hanging drop methods detailed herein. Regulated temperature 
control is desirable to improve crystal stability and quality. Temperatures between 4 and 25 °C are 
generally used and it is often preferable to test crystallization over a range of temperatures. The 
crystals are then subjected to vapor diffusion and bombarded with x-rays to obtain x-ray diffraction 

25 pattern following standard procedures. 

For co-crystallization with a peptide that binds to the coactivator binding site, various 
concentrations of peptides containing a sequence that binds to a coactivator binding site of a nuclear 
receptor of interest can be used in microcrystallization trials, and the appropriate peptides selected 
for further crystallization. Any number of techniques, including those assays described herein can 

30 assay peptides for binding to the coactivator binding site of a nuclear receptor of interest. In a 
preferred embodiment, a NR-box 2 sequence-containing peptide is used for crystallization with TR 
LBD. A preferred peptide contains a NR-box (SEQ ID NO: 1) LxxLL motif, and suitable flanking 
sequences necessary for binding and forming complex with coactivator binding site of the nuclear 
receptor of interest, such as a TR LBD. The binding peptides are then tested in crystallization trials 

35 at various concentrations and ratios of concentrations with a nuclear receptor of interest, for 

example, as described herein and in the Examples. For crystallization trials with TR LBD, the 

hanging drop vapor diffusion method is preferred. Conditions of pH, solvent and solute 
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5 components and concentrations and temperature can be adjusted, for instance, as described in the 
Examples. In the handing drop method, to obtain suitable crystals for x-ray diffraction analysis, 
seeding of prepared drops with microcry stals of the complex can be used. Collection of structural 
information can be determined by molecular replacement using the structure of TR LBD 
determined herein or previously by Wagner et al M supra. The structure is refined following 

1 0 standard techniques known in the art. 

There are many uses and advantages provided by the present invention. For example, the 
methods and compositions described herein are useful for identifying peptides, peptidomimetics or 
small natural or synthetic organic molecules that modulate nuclear receptor activity. The 
compounds are useful in treating nuclear receptor-based disorders. Methods and compositions of 
15 the invention also find use in characterizing structure/function relationships of natural and synthetic 
coactivator compounds. 

The following examples illustrate various aspects of this invention. These examples do not 
limit the scope of this invention. 
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5 EXAMPLES 

Example 1: Expression and purification of wild-type and mutant nuclear receptors and 
coactivators 

A. Human TRS LBD 

Human TRB LBD (His6-E202-D461) was expressed and purified as described (Shiau et al.,^ 

10 Gene (1996) 1 79(2):205-10). Briefly, the protein was expressed from pET (e.g., pET3 and pET28) 
in BL21DE3 at 14°C, induced at OD(600nm) 0.7 with ImM IPTG and incubation was extended for 
24 hours. Cells were harvested and lysed in 50mM sodium-phosphate buffer (pH 8.0), 0.3M NaCl, 
10% glycerol, 25mM fi-merceptoethanol and 0.1 mM PMSF as described above. The lysate was 
cleared by ultracentrifugation (Ti45, 36000 rpm 3 lh, 4°C), loaded on a Talon column equilibrated in 

15 the sodium phosphate buffer described above, washed with 12mM imidazole and eluted with an 

imidazole gradient (12 - 300 mM). TRB LBD containing fractions were loaded in 0.6M ammonium 
sulfate on a TSK-phenyl hydrophobic interaction column and eluted with a reverse ammonium 
sulfate gradient [0.6 - 0 M] in 50% glycerol and 10% acetonitrile. Fractions containing TRB LBD 
were tested for hormone binding, pooled and incubated with a 3 -fold molar excess of T3 (Sigma). 

20 The hydrophobic interaction run was repeated with liganded receptor under the same conditions. 
Liganded receptor, which elutes earlier than unliganded receptor, was collected and buffer changed 
to 20mM Hepes pH7.0, 3mM DTT and 0. 1 T 3 using NAP columns (Pharmacia). For 
crystallization, the protein was concentrated by ultrafiltration (Millipore UFV2BGC10 
concentrators) to a final concentration of 9mg/ml. The yield was about 9.5mg protein per liter 

25 bacterial culture. 

B. Human TR mutants 

Thirty-seven thyroid receptor mutants were created by synthesizing double-stranded 
oligonucleotides which encode the mutant sequence and which have ends allowing them to be 
ligated as a cassette using pairs of the Nsil, PstI, SstI, AlwNI, Apol, PflMI, BstXI, BseRI, BsmFI, 
30 PvuII, Nspl, Smal, Pmll, Bglll and BsmI restriction sites of the hTR(Jl cDNA sequence, or the 3' 
plasmid poly linker Sail, or BamHI restriction sites. The hTRpl sequences thus mutated were 
subcloned into the pCMX vector encoding the full-length 461 amino acid hTRpl sequence. Some 
of the mutations of the hTRpl in the CMX vector and all three mutations of the hERct in the pSG5- 
ER-HEGO vector (Tora et aL, EMBO (1989) 5:1981) were created using Quick Change Site- 
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Directed Mutagenesis Kits (Stratagene). The mutated sequences were verified by DNA sequencing 
using Sequenase Kits (Stratagene). 

C. Human ERq LBD 

The human ERa-LBD 297-554 was overexpressed as described previously (Seielstad, et al., 
susra) in BL21(DE3)pLysS cells transformed with a modified pET-23d-ERG vector that contained 
the sequence Met-Asp-Pro fused to residues 297 to 554 of the hERa (provided by Paul Sigler of 
Yale University). Clarified bacterial lysates were adjusted to 3 M in urea and 0.7 M in NaCl and 
then applied to a 10-ml column of estradiol-Sepharose (Greene, et al., Proc. Natl. Acad. Sci. USA 
(1980) 77:5115-5119; Landel, etal., Mol. Endocrinol. (1994) 5:1407-1419; Landel, et al, J. Steroid 
Biocliem. Molec. Biol. (1997) 6"3:59-73). 

To carboxymethylate the solvent-accessible cysteines, the bound hERct-LBD was treated 
with 5 mM iodoacetic acid in 10 mM Tris, pH 8.1, 250 mM NaSCN (Hegy, et al., Steroids (1996) 
67:367-373). Protein was eluted with 3 x 10-5 M ligand (either DES or OHT) in 30-100 ml of 50 
mM Tris, 1 mM EDTA, 1 mM DTT and 250 mM NaSCN, pH 8.5. The yield of hERa-LBD was 
typically dose to 100% (Seielstad, et al., Biochemistry (1995) 34:12605-12615). The affinity- 
purified material was concentrated and exchanged into 20 mM Tris, 1 mM EDTA, 4 mM DTT, pH 
8.1 by ultrafiltration. The protein was bound to a Resource Q column (Pharmacia) and then eluted 
with a linear gradient of 25-350 mM NaCl in 20 mM Tris, pH 8.1, 1 mM DTT. The hERa-LBD- 
ligand complexes eluted at 150-200 mM NaCl. Pooled fractions were concentrated by 
ultrafiltration and analyzed by SDS-PAGE, native PAGE, and electrospray ionization mass 
spectrometry. 

D. Human ER mutants 

To test the importance of the NR box peptide/LBD interface observed in the crystal, a series 
of site-directed mutations were introduced into the ERoc LBD. These mutations were designed 
either to simultaneously perturb the structural integrity and the nonpolar character of the floor of the 
binding groove (lie 358->Arg, Val 376->Arg and Leu 539->Arg) or to prevent the formation of the 
capping interactions (Lys 362->Ala and Glu 542->Lys). Fusions of glutathione-S-transferase 
(GST) to the wild-type and mutant LBDs were analyzed for their ability to bind 3 5S-labeled GRIP1 
in the absence of ligand or in the presence of DES or OHT. 
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35 S-labeled GRJP1 was incubated with either immobilized GST, immobilized wild type 
GST-hERcc LBD, or immobilized mutant GST-LBDs in the absence of ligand or in the presence of 
DES or OHT. The bound GRIP1 was quantitated after SDS-PAGE. I358R, mutant LBD 
containing a Ile->Arg substitution at residue 358; K362A, mutant LBD containing a Lys->Ala 
substitution at residue 362; V376R, mutant LBD containing a Val->Arg substitution at residue 376; 
L539R, mutant LBD containing a Leu->Arg substitution at residue 539; E542K, mutant LBD 
containing a Glu->Lys substitution at residue 542. 

In the absence of ligand or in the presence of OHT, fusions to the wild-type protein and all 
of the mutant LBDs showed no detectable binding to GRIP1. The lie 358->Arg, Val 376->Arg and 
Leu 539->Arg mutants were all unable to interact with coactivator in the presence of agonist, 
confirming the importance of the packing interactions observed in the crystal. Disruption of either 
the N- or C-terminal capping interaction also compromised GRIP1 binding in the presence of 
agonist. Only the wild-type GST-LBD was able to recognize the coactivator in the presence of 
DES. 

E. Human ER LBD-GST Fusion Protein 

A fusion between glutathione-S-transferase (GST) and amino acids 282-595 of hERa was 
constructed by subcloning the EcoRI fragment from pSG5 ERa-LBD (Lopez et al., submitted 
manuscript) into pGEX-3X (Pharmacia). The He 358-> Arg, Lys 362->Ala, and Leu 539->Arg 
mutations were introduced into the GST-LBD construct using the QuikChange Kit (Stratagene) 
according to the manufacturer's instructions. The Val 376->Arg and Glu 542->Lys mutations were 
created in the GST-LBD construct by subcloning the Bsml/Hindlll fragments of derivatives of 
pSG5-ER-HEGO (Tora, et al., supra) into which these mutations had already been introduced. All 
constructs were verified by automated sequencing (University of Chicago Cancer Research Center 
DNA Sequencing Facility). 

F. Radiolabeled full-length receptors and coactivator proteins 

Wild-type (WT) or mutant pCMV-hTRpi vector and the pSG5-GRIPl and pCMX-SRC-la 
vectors were used to produce radiolabeled full-length receptors and coactivator proteins using the 
TNT coupled Reticulocyte Lysate System (Promega) and [ 35 S]-Met (DuPont). GST-GRIP 1 (amino 
acids 721-1221), GST-GRIP1 (amino acids 563-1121), GST-SRC- la (amino acids 381-882), GST- 
hTRpl (full-length, WT or mutants, WT provided by. C. Costa), and the GST-hRXRot (full-length 
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5 provided by. C. Costa), fusion proteins were produced in E. coli strain HB101 as per the 
manufacturer's protocol (Pharmacia Biotech). 

G. Coactivato r GRIP1 563-767 His6 GST fusion p rotein 

GRIP1 563-767 was cloned as a Bam Hl-Xho I fragment derived from pGEX-2TK GRIP1 
563-1121 into the corresponding sites of pGEX-4Tl. A His6-tag was added by inserting a Xho I- 
Nae I fragment of pET23a into Xho I-Bsa AI sites of this pGEX-4Tl construct yielding pGEX 
GRIP1 563-767His6. Mutants of GRIP1 563-767 were generated by PCR or single stranded 
mutagenesis using oligonucleotides carrying the mutations and a pSG5 GRIP1 vector as template. 
The mutations were confirmed by sequence analysis and integrated into pGEX GRIP1 563-767His6 
as NgoMI - Xho I fragments. The GRIP1 563-767 His6 GST fusion protein was expressed in 
15 HB101 at 37°C. Protein expression was induced with lmM IPTG at an optical density (600 nm) of 
0.7 and extended for 4 hours after induction. Cells were harvested by centrifugation, resuspended 
in sonication buffer (20mM TrisHCl pH 8.0, 0.1M NaCl, 10%glycerol, O.lmM PMSF and protease 
inhibitors (Complete, EDTA free, Boehringer Mannheim)). The resuspended cells were freeze- 
thawed once, incubated on ice with O.lmg/ml lysozyme for 20 minutes and lysed per sonication. 
20 The lysate was cleared by ultracentrifugation (Ti 45, 36000rpm, lh 4°C), the supernatant filtered 
(Costar 0.2um top filter) and loaded on a Talon column (Clontech). The column was washed with 
10 column volumes of sonication buffer supplemented with 12mM imidazole and eluted with an 
imidazole gradient [12 - lOOmM]. At this step the fusion proteins are about 95% pure. Imidazole 
was removed by gelfiltration on NAP columns (Pharmacia), and protein concentrations determined 
25 using the Biorad protein assay. Equal concentrations of the different derivatives of the fusion 
fragment were incubated with glutathione agarose (lh, 4°C) which was equilibrated in binding 
buffer (sonication buffer supplemented with lmM DTT, lmM EDTA and 0.01% NP-40). Beads 
were washed with at least 20 volumes of this buffer, diluted in binding buffer with 20% glycerol to 
40%, frozen in aliqots and stored at-70°C. 

30 H. Coactivator GRIP1 563-767 His6 

GRIP1 563-767 was cloned as a Bam HI - Xho I fragment derived fron pGEX GRIP1 563- 
767His6 into corresponding cloning sites of pET23a yielding pETGRTPl 563-767His6. The 
fragment was expressed in BL21DE3. Expression, cell lysis and Talon purification was identical as 
described for GST GRIP1 563-767His6. The protein eluted from a Talon column in two fractions, 
35 one at 12mM and one between 40 and 70mM imidazole. In the earlier eluting fraction the fragment 
was associated with a 70 kDa protein which was removed by a MonoQ run in 50mM TrisHCl 
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5 pH7.5, 10% glycerol, lmM EDTA, ImM DTT, O.lmM PMSF and protease inhibitors. GRIP1 563- 
767His6 eluted in the flow through and was concentrated by ultrafiltration. At this step the protein 
was more than 95% pure. 

Example 2: Peptide synthesis 

Coactivator peptides were obtained using standard techniques. All peptides were HPLC 
10 purified and analyzed by mass spectroscopy. Peptide concentrations were either determined 
spectroscopically using the tyrosine signal (A 2 ?6 - 1450 M-lcm-1) or by amino acid analysis 
following standard techniques. 

Example 3: Binding assays with nuclear receptors and coactivators 

A. GST-GRIP Pull-down Assays and Peptide Competition Assays 

15 Binding experiments were performed by mixing glutathione beads containing 10 |ag of GST 

fusion proteins (Coomassie Plus Protein Assay Reagent, Pierce) with 1-2 \x\ of the [ 35 S]-labeled 
wild-type or mutant hTRpi (25 fmoles, 4000 cpm of receptor), or coactivators in 150 jal of binding 
buffer (20 raM HEPES, 150 mM KC1, 25 mM MgCl 2 , 10% glycerol, 1 mM dithiothreitol, 0.2 mM 
phenylmethylsuifonyl fluoride, and protease inhibitors) containing 2 mg/ml BSA for 1.5 hrs in the 

20 presence or absence of 1 |uM T 3 . Beads were washed 3 times with 1 ml of binding buffer and the 
bound proteins were separated using 10% SDS-PAGE and visualized by autoradiography. Binding 
was quantitated by phosphorimaging using ImageQuant (Molecular Dynamics). 

For in vitro binding studies GR, TR and their derivatives were translated in the presence of 
[ 35 S]methionine using the TNT Coupled Reticulocyte System (Promega). Separate translations were 

25 performed in the presence and absence of 10pM dexamethasone or lpM RU486 for GR and 10|xM 
triiodothyronine for TR. Expression was quantified by phosphoimager analysis (BAS2000, Fuji). 
For all binding assays 50\xl of a 20% bead suspension containing either 1.6 or 4.0 pM bound 
purified GST GRIP1 fragment (either 568-767 or 563-1121) was incubated with 0.2jil or 1.4jjd in 
vitj-o transcribed and translated TR or GR, respectively. Binding was performed in the binding 

30 buffer described above supplemented with 20 jag/rnl BSA and appropriate hormone. The chosen 
GST GRIP1 fragment concentrations were sufficient to bind either 70 or 100% of the TR 
derivatives. The reaction was incubated at 4°C under rotation for 2 hours. In case of competition 
experiments, the appropriate concentration of peptides were added to the reaction before addition of 
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receptors. However, no differences in the results were noted by adding the peptides after half of the 
incubation of the GST GRIP1 fragment with nuclear receptors. This demonstrates that equilibrium 
is reached under the chosen conditions. Beads were washed five times with 200ul binding buffer + 
BSA at 4°C before elution of the bound proteins in 20ul SDS loading buffer. Eiuted beads and 
input labeled protein were subjected to SDS-PAGE. The fraction of bound nuclear receptors was 
determined by phosphoimager analysis. 



B - GST-hTRftl Pull-down Assay s 

Assay and analysis was performed as for Example 3 A. In vitro binding of [ 35 S]-labeled full- 
length GRIP1, [ 35 S]-labeled full-length SRC-la, and [ 35 S]-labeled full-length hRXRa, to GST- 
hTRpl wild-type (WT) and mutants was performed. Mutants V284R, K288A, I302R, L454R, and 
E457K all bound to hRXRoc with an affinity equivalent to wild type hTR. All of these mutants 
showed decreased ability to bind GRIP1 and SRC-la, as expected from the results of Example 3A. 
The same results were obtained when a GST-SRC1 construct including SRC-la amino acids 381- 
882 was tested for binding of [ 35 SJ-Met-labeled full-length hTRpl WT and mutants (data not 
shown). 

C GST-hERa LBD Pull-down Assays 

The wild-type and mutant GST-hERa LBDs were expressed in BL21(DE3) cells. Total 
ligand binding activity was determined by a controlled pore glass bead assay (Greene, et al., Mol. 
Endocrinol. (1988) 2:714-726) and protein levels were monitored by western blotting with a 
monoclonal antibody to hERa (H222). Cleared extracts containing the GST- hERa LBDs were 
incubated in buffer alone (50 mM Tris, pH 7.4, 150 mM NaCl, 2 mM EDTA, 1 mM DTT, 0.5% 
NP-40 and a protease inhibitor cocktail) or with 1 uM of either DES or OHT for 1 hour at 4°C. 
Extract samples containing thirty pmol of GST-LBD were then incubated with 10 ul glutathione- 
Sepharose-4B beads (Pharmacia) for 1 hour at 4°C. Beads were washed five times with 20 mM 
HEPES, pH 7.4, 400 mM NaCl, and 0.05% NP-40. 35 S-labeled GRIP1 was synthesized by in vitro 
transcription and translation using the TNT Coupled Reticulocyte Lysate System (Promega) 
according to the manufacturer's instructions and pSG5-GRIPl as the template. Immobilized GST- 
hERcc LBDs were incubated for 2.5 hours with 2.5 al aliquots of crude translation reaction mixture 
diluted in 300 ul of Tris-buffered saline (TBS). After five washes in TBS containing 0.05% NP-40, 



27 



.996001 4A3_IA> 



WO 99/060014 



PCT/US99/06899 



10 



5 proteins were eluted by boiling the beads for 10 minutes in sample buffer. Bound 35 S-GRIP1 was 
quantitated by fluorography following SDS-PAGE. 

D. Electrophoretic Mobility Shift Assays 

GRIP1, a mouse P 160 coactivator, recognizes the ERot LBD in a ligand-dependent maimer. 
The binding of agonists to the ERaLBD promotes recruitment of GRIP1, whereas binding of 
antagonists prevents this interaction (Norris, et al., J. Biol. Chem. (1998) 273:6679-88). While 
agonist-bound receptor will bind to all three of the NR boxes from GRIPI, ERa strongly prefers 
NR-box 2 (Ding, et al., Mol. Endocrinol. (1998) J2:302-13). 

An electrophoretic mobility shift assay was used to'directly assess the ability of the NR-box 
2 peptide to bind the purified ERa LBD in the presence of either DES or OHT. Eight microgram 
15 samples of purified hERcc-LBD bound to either DES or OHT were incubated in the absence of the 
peptide, i.e., buffer alone, or in the presence of either a 2-fold or 10-fold molar excess of the GRIPI 
NR-box 2 peptide. The binding reactions were performed on ice for 45 minutes in 10 ul of buffer 
containing 20mM Tris, pH 8.1, lmM DTT, and 200mM NaCl and then subjected to 6% native 
PAGE. Gels were stained with GELCODE Blue Stain reagent (Pierce). 

20 In the presence of the NR-box 2 peptide, the migration of the DES-hERa-LBD complex was 

retarded. In contrast, peptide addition had no effect on the mobility of the OHT-hERa-LBD 
complex. Hence, this peptide fragment of GRIPI possesses the ligand-dependent receptor binding 
activity characteristic of the full-length protein. 

Example: 4 Transfection assays with TR and hERa 

25 HeLa cell transfection and assay conditions are described (Webb et al., Mol Endocrinol 

(1995) 9:443). For TR assays, 5 ug of the reporter p(DR-4) 2 -TK-LUC consisting of two copies of 
the DR-4 element (a direct repeat of the consensus TR response element (TRE) spaced by 4 base 
pairs) placed upstream of a minimal (-32/+4S) thymidine kinase gene promoter linked to luciferase 
(LUC) coding sequences were used. A reporter containing palindromic TREs gave the same results 
(data not shown). Also, 2 ug of the hTRBl expression vector, pCMX-TR (WT or mutant), and 0.5 
ug transfection control vector, P J3LacZ, which contains the SV40 promoter linked to the £- 
galactosidase gene, were used. Other cells co-transfected with vector or receptor constructs can be 
used for same purpose. Alternative cells expressing sufficient levels of an endogenous receptees), 
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5 or cells selected that express a single reporter, can be used for transfection assays, including MCF-7 
cells expressing ER (Webb et al., supra) , and GC cells expressing TR (Norman et al., J. Biol 
Chem. (1989) 254:12063-12073). 

For hERcc assays, 5 |ug of estrogen responsive reporter plasmid encoding chloramphenicol 
acetyltransferase (CAT), pERE-colITATA (Sadovsky, et al, Mol Cell Biol (1995) 75: 1 554), 0.5 jxg 

10 expression vector encoding full-length hERtx , pSG5-er HEGO (WT or mutants), and 2 jxg of 
pj31acz, were used. For the experiments of Figures 2 and 4, 0.5 jag of a full-length GRIP1 
expression vector, pSG5-GRIPl, was also included in the transfection. Transfected cells were 
treated with or without 1 T 3 or E 2 , as indicated. After culturing for 24 hrs, the LUC or CAT 
activities were assayed and the (5-galactosidase activities were also assayed to correct for 

1 5 differences in transfection efficiencies. The triplicate points were averaged and standard deviations 
were less than 10%. 

Example 5: Hormone binding assays for wild-type and mutant TRs 

The T3 binding affinity constants (Kd) for in vitro -translated WT and mutant TRs were 
measured using [ 125 I] 3,5,3 ? triiodo-L-thyronine ([ 125 I]T 3 ) in gel filtration binding assays as 

20 described (Apriletti et al., Protein Expr. Parif. (1995) 5:363). Both the Kd and standard error (S.E.) 
values were calculated using the Prism computer program (GraphPad Software, Inc.). Mutations 
are indicated by the single-letter amino acid abbreviations, with the native residue name, followed 
by the primary sequence position number, and then the mutated residue name. The affinity of the 
WT TR is 81 ± 12 pM. The relative affinity was determined by dividing the WT Kd by each 

25 mutant Kd. The 37 mutants tested with their relative affinities are: E217R (123%), E227R (109%), 
K242E (92%), E267R (117%), H271R (123%), T277R (7%), T281R (145%), V284R (105%), 
D285A (89%), K288A (98%), C294K (94%), E295R (118%), C298A (87%), C298R (141%), 
E299A (171%), I302A (86%), I302R (99%), K306A (6%), K306E (6%), P384R (164%), A387R 
(107%), E390R (151%), E393R (146%), L400R (95%), H413R (109%), H416R (153%), M423R 

30 (156%), R429A (48%), S437R (170%), L440R (174%), V444R (89%), T448R (234%), E449R 
(36%), P453E (32%), L454R (26%), L456R (46%), E457K (71%). 

Example 6: Coactivator binding assays for wild-type and mutant TRs 

Wild type (WT) TR and most of the TR mutants liganded to 3,5,3'-triiodo-L-thyronine (T 3 ) 
bind equally well to the coactivator, GRIP1. In all cases, GRIP1 binding was hormone-dependent 
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5 (data not shown). Mutations L454R and E457K in surface residues of helix 12 abolish GRIP1 
binding (Figure 1). Mutations in two residues of helix 3, V284R and K288A, and two residues of 
helix 5, 1302R and K306A, also impair binding (Figure 1). Five mutations with dirninished GRIP1 
binding (V284R, K2S8A, I302R, L454R, and E457K) also show decreased binding to another 
coactivator, SRC-la (data not shown). Thus, these results show that two different coactivators 
1 0 recognize the same TR surface residues. 



Example 7: TR residues involved in ligand-dependent transcription activation in context of a 
cell 

Residues involved in ligand-mediated transcription activation were identified by testing the 
TR mutants of Example 8 in HeLa cells. T 3 increased reporter gene activity 5-fold in cells 
expressing either WT TR or mutated TRs showing normal GRIP1 binding (representative mutants 
are shown in Figure 1. By contrast, TR mutants with diminished or absent GRIP1 binding (V284R, 
K288A, I302R, K306A, L454R, and E457K) show a dirninished or absent response to T 3 which 
correlates with the GRIP1 binding defect Overexpression of GRIP1 increases activation by the 
WT TR and rescues activation by TR mutants roughly in proportion to the severity of the defect of 
GRIP1 binding and activation (Figure 2). These results suggest that the same residues are required 
for coactivator binding, function of the endogenous coactivator(s) in HeLa cells, and responsiveness 
of TRs to GRIP 1. 



Example 8: Effect of TR mutations on other receptor functions 

The effects of the mutations on other receptor functions also were examined . All of the 
25 mutants bound radiolabeled thyroid hormone (Kd values, 6%-234% that for native receptor); 
occasional lower values were expected because some residues have partially buried side chains. 
None of the residues that decrease GRIP1 binding affected TR binding to a GST-RXR fusion 
protein or to DNA using three different DNA half-site arrangements and testing with or without 
added RXR (data not shown). Some mutations that affect GRIP1 binding occur in a region 
30 spanning helices 3-5, which has been suggested as important for TR/RXR heterodimerization 
(O'Donnell et al., susm; Lee et al., Mol. Endocrinol (1992) 6: 1867- 1873). In contrast, however, the 
above results indicate that these residues do not contribute to TR/RXR heterodimerization. Further, 
TRs mutated in the CBS residues retain the ability of WT TR of T 3 -dependent inhibition of the 
activity of the Jun and Fos transcription factors at an AP-1 site (Saatcioglu et al., supra\ suggesting 
35 that the CBS residues do not participate in TR actions mediated through these proteins. These data 
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5 indicate that the mutational effects are specific, the amount of input labeled TR in the different 
reactions is comparable, and the levels of expression of the mutant TRs are comparable to those of 
WT receptors. 

Example 9: Coactivafor binding site in ER 

Three separate mutations (K362A, V376R, and E542K) were created in human estrogen 
10 receptor-a (hERa) which align to three of the effective positions in hTRpl (K288A, I302R, and 
E457K). All three mutations diminish GRIP1 binding and abolish transcriptional activation 
(Figure 3), and mutant V376R, with 10% residual GRIP1 binding, was rescued partially by 
overexpression of GRIP1 (Figure 4). As a control, the ER mutants demonstrated a normal 
hormone-dependent ability to activate a vitellogenin-LUC hybrid reporter gene, GL45, which 
15 responds to the ER amino-terminal activation function (Berry et ah, EMBO J (1990) 9:2811-2818) 
(data not shown). The finding that similar residues are required for GRIPl binding and 
transcription activation activity in the TR and ER suggests that the coactivator binding site residues 
are similar in different nuclear receptors. 

Example 10: Coactivator NR-box binding affinity for TR 

20 To study the interaction between nuclear receptors and GRIPl in vifro, a fragment of GRIPl 

(563-767) was purified that contains all three NR-boxes (Figures 6 and 7). The fragment was 
found to be highly soluble and, in agreement with a secondary structure prediction using PhD, 
displays a mainly alpha-helical far UV-CD spectrum (data not shown). Three of the four helices 
predicted for the fragment include the NR-boxes at their C-terminus., suggesting that these boxes are 

25 part of amphipathic alpha-helices. These results show that the NR-boxes of GRIPl are contained in 
a soluble, alpha-helical 24kD fragment. 

Binding assays show that GRIPl NR-boxes 1, 2 and 3 5 interact differentially with hTRB 
LBD (Figure 7). A GST-fusion of the GRIPl (563-767) fragment strongly binds TR (kD or EC50 ) 
in a ligand depend fashion. Replacement of the hydrophobic residues of NR-box 3 with alanine 
30 does not reduce binding of TR significantly, whereas similar replacement of NR-box 2 results in 
loss of TR binding of about 50%. By titrating the amount of GRIPl fragment, about a 4-fold 
stronger binding of TR for NR-box 2 (EC50 = 1.0 \xM) over NR-box 3 (EC50 = 4.0 \iM) was 
estimated. In the absence of functional NR-boxes 2 and 3, almost no binding to TR was detected 
suggesting that under these experimental conditions NR-box 1 is not a cognate binding site for TR. 
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Full length TR or TR-LBD bound GRJP1 equally. These results show that TR recognizes GRIP1 
NR-box 2 and 3, with preference for NR-box 2. 



Example 11: Coactivator NR-box binding affinity for GR 

GR also was found to bind GRIP1 (563-767) in a ligand-dependent manner (Figure 8). 
However, in contrast to TR, extension of GRIP1 (563-767) to residue 1121 increases binding to GR 

10 about 3-fold suggesting an additional binding site on GRIP1 for GR. Binding of the larger fragment 
remains ligand-dependent; no interaction can be observed in the presence of the GR partial 
antagonist RU486. These results are in agreement with in vivo 2-hybrid GR GRIP1 interaction 
studies. In the presence of ligand no difference was detected in the binding of GRIP 1 by full length 
GR or a deletion mutant of GR that lacks the N-terminal activation domain AF-1. However in the 

15 absence of ligand, binding of GR to GRIP1 (563-1 121) increased by about 10-fold indicating that 
sequences in the GR N-terminus are able to suppress binding of unliganded GR to this additional 
binding site in GRIP1. Additionally, GR did not bind to a GRIP1 (563-767) mutant in which both 
NR-box 2 and 3 are replaced by alanines, and binds most strongly to a fragment that lacks a 
functional NR-box 2. As with TR, GR does not recognize NR-box 1. In contrast to TR, the GR 

20 prefers NR-box 3 to NR-box 2. These results demonstrate that GR prefers binding to NR-box 3 and 
interacts with an additional GRIP1 site within the CREB (cAMP - response - element binding 
protein) - binding protein (CBP) binding site. 

Example 12: Coactivator peptide binding affinity for TR 

To investigate whether the preference of TR for NR-box 2 is dependent on the sequence or 
25 structural context of the NR-boxes, competition studies on the interaction of GRIP1 with hTRB 
LBD were performed using coactivator peptides containing different NR- boxes (NR-box 2 peptide 
(residues 11-23 of SEQ ID NO: 6) EKHKILHRLLQDS, and NR-box 3 peptide (residues 9-21 of 
SEQ ID NO: 7) ENALLRYLLDKDD) (Figure 9). Consistent with the interaction of hTR LBDS 
with GRIP1 (563-767) NR-box mutants, a peptide containing NR-box 1 competes the interaction of 
30 GRIP1 with hTRB LBD only at very high concentrations (EC50 = 130 uM). Peptides containing 
either NR-box 2 or 3 compete GRIP1 (563-767) efficiently and display the preference of hTRB 
LBD for NR-box 2 (EC50 (NR-box 2) = 1.5 uM, EC50 (NR-box 3) = 4 uM). The apparent 
affinities (EC50) for peptides of NR-box 2 and 3 are comparable with the analogous GRIP1 (563- 
767) NR-box mutants suggesting that the preference of TR for NR-boxes is solely dependent on the 
35 sequence and independent of tie structural context of the NR-boxes. 
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5 Peptides of NR-box 2 or 3 compete GRIP1 (563-767) containing functional NR-boxes 2 and 

3 or a mutant of this fragment that contains only a functional NR-box 2 with comparable affinity. 
Thus, while TR can bind both NR-box 2 and 3, in a GRIP1 coactivator peptide fragment contai n i n g 
both boxes, TR preferentially binds NR-box 2. 

These results show the preference of TR for NR-box 2 is sequence dependent. 

10 The same types of assays for TR competition are performed to assess coactivator peptide 

binding affinity for GR. The peptide concentrations are normalized relative to TR for obtaining 
comparable dose response curves. 

Example 13: Binding affinity of TR for extended coactivator peptides 

Sequence identity between all three central NR-boxes of the pi 60 coactivator family is 
15 limited to the conserved leucine residues of the (SEQ ID NO: 1) LxxLL motif (Figure 6). 
However, the sequence conservation of a particular NR-box can extend into neighboring residues. 
To investigate the contribution of these neighboring residues to affinity and specificity of the 
different NR-boxes for TR, the ability of peptides containing individual NR-boxes with different 
^ lengths of adjacent sequences to compete with the interaction of GRJP1 (563-767) with hTRB LBD 
20 were compared (Figure 10). 

A peptide consisting of the minimal motif of NR-box 3 (residues 12-17 of SEQ ID NO: 7; 
LLRYLL) does not compete the TR LBD interaction with GRIP1 (563-767). A peptide consisting 
of the NR-box 2 (residues 15-20 of SEQ ID NO: 6; ILHRLL) also does not sufficiently compete the 
interaction (data not shown). Extending peptides containing a (SEQ ID NO: 1) LxxLL motif to 

25 include adjacent residues increased affinity for both NR-box motifs and magnified the preference of 
TR for NR-box 2 (NR-box 2 peptides: (residues 11-23 SEQ ID NO: 6) EKHKILHRLLQDS and 
(residues 7-23 of SEQ ID NO: 6) TSLKEKHKILHRLLQDS; and NR-box 3 peptides: (residues 8- 
24 of SEQ ID NO: 7) KENALLRYLLDKDDTKD and (residues 5-24 of SEQ ID NO: 7) 
PKXKENALLRYLLDKDDTKD). A chimeric peptide containing the NR-box 3 motif in the 

30 context of the NR-box 2 flanking sequences (SEQ ID NO: 31; TSLKEKHKLLRYLLQDSS) binds 
like a NR-box 2 peptide. 

These results demonstrates that preference of TR for NR-box 2 is at least partially due to 
features of the bound peptide (residues 15-20 of SEQ ID NO: 6; ILHRLL), but that their affinity 
and specificity is modulated by adjacent sequences. 
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5 Example 14: Binding affinity of TR and GR for mutant coactivator 



A. TR affinity for ILxxLL motif residues 

To investigate the role of the hydrophobic residues in NR-box 2, individual residues of the 
(residues 15-20 of SEQ ID NO: 6) ILHRLL motif were replaced by alanine in the background of 
GRIP1 (563-767) containing a non-functional NR-box 3 (Figure 11). Surprisingly, replacement of 
10 any of the conserved leucines prevents binding to TR almost completely. Only replacement of the 
nonconserved isoleucine exhibited a lessened but still severe impact on the affinity of NR-box 2 for 
TR. As replacement of a single leucine by alanine is sufficient to overcome the interaction of both 
the remaining hydrophobic residues and adjacent sequences with hTRB LBD, it appears that their 
contribution to the affinity of NR-box 2 for hTRB LBD is cooperative rather than additive. 

15 Similar results were obtained by competing the interaction of hTRB LBD with the GRIP1 

(563-767) NR-box 3 mutant using peptides in which either IL, HR or LL of the NR-box 2 motif are 
replaced by alanines (Figure 11). Whereas the peptides containing the IL or LL replacement failed 
to interact with the hTRB LBD even at very high concentrations, in agreement with a proposed 
alpha-helical structure of the motif, replacement of the "HR spacer" by alanines showed a marginal 

20 effect on the affinity of the peptide for TR-LBD. 

Replacement of single leucine residues of NR-box 2 by phenylalanine reduced the affinity 
of NR-box 2 peptides for TR LBD about 100-fold, replacement of the isoleucine about 10-fold 
(Figure 11). Therefore, the interaction of TR with GRIP1 relies not simply on the hydrophobicity 
of the (SEQ ID NO: 1) LxxLL motif, but also on positive contributions by the leucine residues 
25 themselves. 

These results demonstrate that single mutations of the conserved leucines in the (SEQ ID 
NO: 1) LxxLL motif strongly reduce affinity of GRIP 1 for hTRB LBD. 

Collectively, the above examples demonstrate that peptides containing NR-boxes, 
particularly NR-box 2, reproduce the affinity and specificity of the interaction of GRIP 1 (563-767) 
30 with hTRB LBD. 

B. TR affinity of FxxLW and FxxAL motif residues 

The three conserved leucines of the NR-box 2 (SEQ ID NO: 2) ILxxLL motif are embedded 
in the hydrophobic cleft of the hTRB LBD:NR-box 2 interaction surface, whereas the non conserved 
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5 isoleucine is located on the rim of this cleft where structural changes can be more easily 
accommodated (See Example 18). In agreement with this structure, replacement of this residue by 
alanine or phenylalanine reduced binding to hTRB LBD to a less extent than the comparable 
mutations of the conserved leucine residues. The surface generated by the three conserved leucines 
(L690, L693, L694) of the NR-box 2 peptide (residues 12-24 of SEQ ID NO: 6) 686- 

10 KHKILHRLLQDSS-698 is highly complementary to the corresponding binding site in the hTRB 
LBD (Figures 16 and 17). Comparison of this binding site to other nuclear receptors shows that it 
contains a structural motif that is unique, highly conserved and present in all known structures of 
nuclear receptor LBDs (Wurtz et al., Nat Struct Biol (1996) 3:87-94; Wagner et al. 5 supra : Renaud 
et ah, Nature (1995) 575:681-689; Bourguet et al., Nature (1995) 375:377-382; and Brzozowski et 

15 al., Nature (1997) 3SP:753-758). 

Interaction of highly conserved hydrophobic motifs, which are part of amphipathic alpha- 
helices, with complementary hydrophobic surfaces resembles a feature observed for the interaction 
of several other transcriptional activators with their target proteins (p53:MDM2, VP16:TAFII31 or 
CREB.KIX-CBP). However, the motifs of p53 (FxxLW), VP 16 (FxxAL) and CREB (YxxIL) 

20 differ from the (SEQ ID NO: 1) LxxLL motif of nuclear receptor coactivators. A Fxxxh motif may 
be generally involved in interaction with TAFII31, where "h" represents any hydrophobic residue. 
Though with respect to the known structures, complementarity of the interacting hydrophobic 
surfaces identified here seem to be a common feature of these interactions, cross-reactions between 
different motifs are possible. For instance, VP16, p53, and p65 (FxxFL) are able to functionally 

25 interact with TAFII31, or p53 and E2F1-DP1 (FxxLL) both interact with MDM2. These 
interactions are sensitive to mutations in the Fxxxh motif. Therefore it appears that either 
complementarity of the hydrophobic surfaces is not an absolute requirement or that induced fitting 
of the interacting surfaces is possible. 

Based on these observations, studies were performed to determine whether GRIP1 interacts 
30 with TAFII31 or MDM2. However, no interaction was detected. GRIP1 mutants changing NR-box " 
2 ( SEQ ID NO: 1; LxxLL) to VP16 ( SEQ ID NO: 4; FxxAL) or p53 (SEQ ID NO: 3; FxxLW) like 
binding sites also failed to bind TAFII3 1 or MDM2 demonstrating that the presence of the correct 
binding site is not sufficient to create binding (data not shown). Moreover, peptides containing the 
VP 16 or p53 binding sites are not able to compete the interaction of GRIP1 with TR, even in very 
35 high concentration, but do compete the interaction with GR (data not shown). The affinity of this 
interaction is weak, but comparable to affinity of a peptide of NR-box 2 that, in the context of a 
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5 GRIP1 mutant lacking NR-box 3, binds GR in vivo (Ding et al., supraY This binding is only about 
ten times less than a peptide containing NR-box 3, GR's primary binding site. 

As shown above, GR binds GRJP1 (563-767) with about one-fifth the affinity than a 
comparable amount of TR. Thus, the high concentration of NR-box 3 peptide required to compete 
the interaction of GR with GRIP1 (563-767) may rather reflect a weak affinity of GR for the 
peptide rather than a particular strong interaction of GR with GRIP1 (563-767). 



10 



These results suggest that at least on the peptide level, other hydrophobic motifs besides 
(SEQ ID NO: 1) LxxLL can interact with the coactivator binding site, but that it is receptor 
dependent. 

C TR affinity for residues adjacent to ILxxLL motif 

1 5 Peptides containing a FxxLL motif bind TR but with two orders of magnitude lower affinity 

than a (SEQ ID NO: 1) LxxLL motif (Figure 11). To test whether the additional changes in the 
hydrophobic motif or adjacent sequences of the VP16 peptide prevent its binding to TR, a chimeric 
peptide containing the NR box-2 motif (SEQ ID NO: 1) LxxLL in the context of the VP16 sequence 
was constructed. This peptide binds to TR but with an about 100-fold lower affinity than the 

20 original NR-box 2 peptide. Thus, the inability to bind the VP 16 peptide appears to be due to the 
combination of an imperfect hydrophobic motif and the incompatibility of TR to adjacent sequences 
of the VP 16 motif. 

As the interaction of the chimeric peptide with GR was comparable to the original NR-box 2 
and VP 16 peptides, this incompatibility appears due to TR-specific features in the NR-box 
25 interaction surface. These results show sequences adjacent the NR-box motif LxxLL can reduce 
binding of NR-box 2 to TR, but not GR. 



Example 15: Crystallization and Structure Determination of NR LBD Complexes 

A. Crystallizat ion of hTRB LBD with T 2 and GRIPl NR-box 2 Vrp tM* 

Several peptides containing GRIPl NR-box 2 were tested in crystallization trials with the 

hTRB LBD. The complex of the hTRB LBD with the GRIPl NR-box 2 peptide 686- 

KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6) produced crystals that were dependent 

on both the presence and the concentration of the peptide. Without the peptide, the hTRB LBD 

precipitated immediately. However, nucleation was erratic, but could be overcome through seeding 
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5 . of prepared drops with, microcrystals of the hTRfi LBDiGRIPl NR-box 2" /peptide complex. 
Structure of ; the hTRfi LBD:GRIP1 NR-box 2 peptide complex was determined by molecular 
replacement using the structure of the hTRB LBD determined previously (Wagner et al. 3 supra\ and 
reiined to a resolution of 3.i5A (Table 1). The refined model consists of residues K211-P254 and 
V264-D461 of monomer 1 of the hTRfi LBD, residues K211-P254 and G261-D461 of monomer 2 

10 of the hTRB LBD, and the GRIP1 NR-box 2 peptides (residues 14-24 of SEQ ID NO: 6) 688- 
KILHRLLQDSS-698, and (residues 14-22 of SEQ ID NO: 6) 688-KILHRLLQD-696 (Appendix 

* 1). 

Briefly, the complex between the hTRB LBD and the GRIP1 NR-box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6) was prepared by mixing (equal) 

15 volumes of a solution of 9mg/ml hTRfi LBD in 20mM HEPES pH 7.4 with a solution of 14 mM 
GRIP1 in 0.4mM ammonium acetate pH 4.72, and incubating the mixture on ice for 1 hour. 
Crystals were obtained after 2 days at 4°C using hanging drop vapor diffusion from a drop 
containing 1.5 pi of hTRfi LBD: GRIP 1 complex, prepared as described, and 0.5pJ 15%PEG 4K, 
0.2M sodium citrate pH 4.9, suspended above a reservoir containing 10% PEG 4K, 0.1M 

20 ammonium acetate, and 0.05 M sodium citrate (pH 5.6). After allowing the drop to equilibrate for 1 
hour, 0.2pl of 10-3 to 10-5 dilutions of microcrystals in reservoir buffer were introduced to provide 
nucleation. Crystals are of space group P3121 (a=95.2, b=95.2, c=137.6) and contain two 
molecules of the hTRB LBE> and two molecules of the GRIP1 NR-box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6). 
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Table 1 



Data collection, phasing, and refinement statistics 



Data collection 


Data set 


Resolution 
(A) 


Reflections 


Coverage (%) 




Native 


3.6 


measured 


unique 


96.3 


0.007 


35565 


8490 




Rotation search 


Search model 


Euler angles (°) 


Correlation coefficient 


©i 


0 2 


e 3 


Highest peak 


Highest false peak 


hTR P LBD 


Ml 


60.12 


80.68 


241.90 


16.3 




M2 


9.93 


87.70 


180.6 


15.9 


14.2 




Translation search 




Fractional coordinates 


Translation function 


X 


y 


z 


Highest peak (o) 


Highest false peak (o) 




Ml 


0.522 


0.428 


0.250 


19.52 


10.02 


M2 


0.200 


0.932 


0.119 


26.11 


5.77 




Refinement 




Resolution 
(A) 


Reflection 


R 


Rfree 


F>2( 


25 - 3.7 


7614 


0.2990 


0.3219 


All data 


■ 25 - 3.7 


7851 


0.3010 


0.317 





Rsym- 2 h Si | u (7 h ( | / 2/ h for the intensity (I) of / observations of reflection h 
Correlation coefficient = Z h Eo 2 Ec 2 - Eo 2 Ec 2 / [S h (Eo 2 - Eo 2 f 2 h (Ec^Ec 2 ) 2 ]** 2 
Translation function (t a , t b , .) = S h (\E 0Qi f - 2 h <\Eo {h) f>) (Ec (h &KJ 2 " <l^(h)| 2 ) 
where E Q represents the normalized observed structure factor amplitudes, and 
Ec represents the normalized structure factors for the search model in a 
triclinic unit cell with dimensions identical to that of the crystal. The 
reported peak height represents the value of the function for the translation 
(t a , t b ) of the NCS monomers, divided by the rms value of the translation 
function density. 
Rfactor = 2|F obs .F calc |/2|F 0bs |. 

Rfree is calculated the same as R factor, except only for 10% of the reflections that 
were set aside for cross validation and not used in refinement. 
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5 B. Crystallization of hERcc LBP with PES and GRIP1 NR-box 2 Peptide 

Crystals of a DES-hERa LBD-GRIP1 NR-box 2 peptide complex were obtained by hanging 
drop vapor diffusion. Prior to crystallization, the DES-hERa LBD (residues 297-554) complex was 
incubated with a 2-4 fold molar excess of the GRIP1 NR-box 2 peptide 686-KHKILHRLLQDSS- 
698 (residues 12-24 of SEQ ID NO: 6) for 7-16 hr. Two *xL samples of this solution were mixed 

10 with equal volume samples of reservoir buffer consisting of 25-27% (w/v) PEG 4000, 90 mM Tris 
(pH 8.75-9.0) and 180 mM Na Acetate and suspended over wells containing 800 pL of the reservoir 
buffer. After 4-7 days at 19-21°C, rod-like crystals were obtained. The coactivator complex 
crystals lie in the spacegroup P2i with cell dimensions a=54.09, b=82.22, c=58.04 and [3=111.34. 
Two molecules each of the DES-LBD and the coactivator peptide form the asymmetric unit A 200 

1 5 fim x 40 jam x 40 (im crystal was transferred to a cryosolvent solution containing 25% (w/v) PEG 
4000, 10% (w/v) ethylene glycol, 100 mM Tris (pH 8.5), 200 mM Na Acetate and 10 ]xM peptide 
and frozen in an N2 stream at -170°C in a rayon loop. Diffraction data from this crystal were 
measured at -170°C using a 300 mm MAR image plate at the Stanford Synchrotron Radiation 
Laboratory (SSRL) at beamline 7-1 at a wavelength of 1.08 A. The diffraction images were 

20 processed with DENZO and scaled with SCALEPACK (Otwinowski, et al., Methods EnzymoL 
(1997) 276:307-326) using the default -3a cutoff. 

C. Crystallization of hERa LBP with OHT 

Crystals of the hERa LBD (residues 297-554) complexed to OHT were obtained by the 
hanging drop vapor diffusion method. Equal volume aliquots (2 \xL) of a solution containing 3.9 

25 mg/mL protein-ligand complex and the reservoir solution containing 9% (w/v) PEG 8000, 6% (w/v) 
ethylene glycol, 50 mM HEPES (pH 6.7) and 200 mM NaCl were mixed and suspended over 800 
^iL of the reservoir solution. Hexagonal plate-like crystals formed after 4-7 days at 21-23°C. Both 
crystal size and quality were improved through microseeding techniques. These crystals belong to 
the space group P6 5 22 with cell parameters a=b=58.24 A and c=277.47 A. The asymmetric unit 

30 consists of a single hERa LBD monomer; the dimer axis lies along a crystallographic two-fold. A 
single crystal (400 urn x 250 fim x 40 fim) was briefly incubated in a cryoprotectant solution 
consisting of 10% (w/v) PEG 8000, 25% (w/v) ethylene glycol, 50 mM HEPES (pH 7.0) and 200 
mM NaCl and then flash frozen in liquid N2 suspended in a rayon loop. Diffraction data were 
measured at -170°C using a 345 mm MAR image plate at SSRL at beamline 9-1 and at a 
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wavelength of 0.98 A. The diffraction images were processed with DENZO and scaled with 
SCALEPACK (Otwinowski, et al., supra ) using the default -3a cutoff. 



Example 16: Structure determination and refinement of NRLBD complexes 

A - Structure of hTRB LED with T, and GRIP! NR-box 2 Pep tide 

Data were measured using Cu Ka radiation from an R-axis generator at 50 kV and 300 mA 

10 with a 0.3mM collimator and a Ni filter. Reflections were measured using an R-Axis II detector 
and integrated with Denzo, and equivalent reflections scaled using Scalepack (Otwinowski and 
Minor, "Processing of x-ray diffraction data collected in oscillation mode." In Macromolecular 
Crystallography, Part A (ed. C.W. Carter, Jr. and R.M. Sweet), pp. 307-326. Academic Press, New 
York, NY). Possible rotation function solutions were calculated using normalized amplitudes in 

15 AMORE from a model of hTRB LBD with the ligand, T 3 , omitted; translation function solutions 
were subsequently determined using TFFC for the two rotation solutions with the highest 
correlation coefficients. For two hTRB LBD molecules in the asymmetric unit, the calculated 
solvent content is 52%. After rigid body refinement of the two hTRB LBD molecules, electron 
density maps were calculated. Strong positive density present in both the anomalous and 

20 conventional difference Fourier maps for the iodine atoms of the T 3 ligand confirmed the 
correctness of the solution. The iodine atoms for both T 3 ligands were modeled as a rigid body, and 
the structure refined with strict NCS symmetry using CNS. Both 2FoFc and FoFc electron density 
maps showed interpretable density, related by the NCS operator, near HI 2 of both molecules of the 
hTRB LBD. The electron density could be modeled as a short ct-helix, and the observed side chain 

25 density was used to tentatively assign the sequence and direction to the chain. The refined model 
consists of residues of the hTRB LBD, and peptide residues of the GRIP1 NR-box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6). 

Atomic coordinates of the hTRB LBD.GRP1 site 2 peptide complex are attached as 
Appendix 1. 



30 B. 



Structure of hERoc LB D with PES and GRIP! NR-box 2 Ppp f.V^ 



Initial efforts to determine the structure of the DES-hERcc LBD-NR box 2 peptide 686- 
KHKELHRLLQD S S-698 (residues 12-24 of SEQ ID NO: 6) complex utilized a low resolution (3.1 
A) data set (data not shown). A self-rotation search implemented with POLARRFN ("The CCP4 
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5 suite: programs for protein crystallography", Acta Crystallogr. (1994) D50:760-763) indicated the 
presence of a noncrystallographic dyad. The two LBDs in the asymmetric were located by 
molecular replacement in AMoRe (CCP4, 1994) using a partial polyalanine model of the human 
RARy LBD (Renaud, et aL, supra ) as the search probe (R=58.2%, CC=35.6% after placement of 
both monomers). Given that the model at this point was both inaccurate (r.m.s.d. 1.7 A between 

10 this model and the final model based on Ca positions) and incomplete (accounting for only -45% 
of the total scattering matter in the asymmetric unit), an aggressive density modification protocol 
was undertaken. Iterative cycles of two-fold NCS averaging in DM (CCP4, 1994) interspersed with 
model building in MOLOC (Muller, et aL, Bull Soc. Chinu Belg. (1988) 97:655-667) and model 
refinement in REFMAC (Murshudov, et aL, Acta Crystallogr. (1997) D53:240-255) (using tight 

15 NCS restraints) were used to quickly build a model of the LBD alone. For this procedure, MAMA 
(Kleywegt, et aL, "Halloween... masks and bones. In From First Map to Final Model", Bailey, et al, 
eds., Warrington, England, SERC Daresbury Laboratory, 1994) was used for all mask 
manipulations and PHASES (Furey, et al., PA33 Am. Cryst Assoc. Mtg Abstr. (1990) 18:73) and 
the CCP4 suite (CCP4, 1994) were used for the generation of structure factors and the calculation of 

20 weights. 

However, although the DES-hERoc LBD-NR complex model accounted for -90% of the 
scattering matter in the asymmetric unit, refinement was being hampered by severe model bias. 
The high-resolution data set of the DES-hERoc LBD-NR-box 2 peptide complex became available 
when the R& e e of the OHT-hERa LBD model was -3 1%. Both monomers in the asymmetric unit of 

25 the DES complex crystal were relocated using AMoRe and the incompletely refined OHT-hERa 
LBD model (with helix 12 and the loop between helices 11 and 12 removed) as the search model. 
The missing parts of the model were built and the rest of the model was corrected using MOLOC 
and two-fold averaged maps generated in DM. Initially, refinement was carried out with REFMAC 
using tight NCS restraints. At later stages, the model was refined without NCS restraints using the 

30 simulated annealing, minimization and B-factor refinement protocols in X-PLOR and a maximum- 
likelihood target. All B-factors were refined isotropically and anisotropic scaling and a bulk solvent 
correction were used. The Rf rec set contained a random sample of 6.5% of all data. In refinement, 
all data between 27 and 2.03 A (with no cr cutoff) were used. The final model was composed of 
residues 305-549 of monomer A, residues 305-461 and 470-554 of monomer B, residues 687-697 of 

35 peptide A, residues 686-696 of peptide B, 164 waters, two carboxymethyl groups and a chloride 
ion. According to PROCHECK, 93.7% of all residues in the model were in the core regions of the 
Ramachandran plot and none were in the disallowed regions. Thus, the structure of the DES-hERoc 
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LBD-NR-box 2 peptide complex has been refined to a crystallographic R-factor of 19.9% 
(Rfree^S.O^o) using data to 2.03 A resolution. 

lie 689 from the peptide interacts with three receptor residues (Asp 538, Glu 542 and Leu 
539). The y-carboxylate of Glu 542 forms hydrogen bonds to the amides of residues 689 and 690 of 
the peptide. A water-mediated hydrogen bond network is formed between the imidazole ring of His 
377, the y-carboxylate of Glu 380, and the amide of Tyr 537. Three residues (Glu 380, Leu 536 and 
Tyr 537) interact with each other through van der Waals contacts and/or hydrogen bonds. 
Intriguingly, mutations in each these three residues dramatically increase the transcription activity 
of unliganded ERot LBD (Eng, et aL, Mol Cell Biol (1997) 17:4644-4653); Lazennec, et al., Mol 
Endocrinol (1997) 11:1375-86; White, et aL, EMBOJ. (1997) 16:1427-35), Atomic coordinates of 
DES-LBD-peptide complex are attached as Appendix 2. 
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5 Table 2 

Summary of Crystallographic Statistics 



Ligand 





Data Collection 


DES 


OHT 




Space group 


P2i 


P6522 


10 


Resolution 


2.03 


1.90 




Observations 


104189 


269253 




Unique 


30265 


23064 




Completeness (%) 


98.4 


99.1 




Rsym(%) a 


7.8 


7.0 


15 


Average I/cri 


9.8 " 


16.1 




Refinement 








Number of non-hydrogen atoms 


4180 


2070 




Rcyst (%) b /R free (%) 


19.9/25.0 


23.0/26. 


20 


Bond r.m.s. deviation (A) 


0.006 


0.006 




Angle r.m.s. deviation (°) 


1.05 


1.05 




Average B factor (A 2 ) 


34.0 


40.4 



3 Rsym "Si I Is- <I»> I / Zili where <I-,> is the average intensity 
over symmetry equivalents 
25 b Rcryst^ZIFo-Fcl/ZIFol 



C. Structure of hERa LBD-OHT complex 

The OHT complex data set was then collected. Starting with one of the monomers of the 
30 preliminary low-resolution DES-hERa LBD-NR-box 2 peptide model as the search probe, 
molecular replacement in AMoRe was used to search for the location of LBD in this crystal form in 
both P6i22 and P6 5 22. A translation search in P6 5 22 yielded the correct solution (R=53.8%, 
CC=38.2°/o). In order to reduce model bias, DMMULTI (CCP4, 1994) was then used to project 
averaged density from the DES complex cell into the OHT complex cell. Using MOLOC, a model 
35 of the hERa LBD was built into the resulting density. The model was refined initially in REFMAC 
and later with the simulated annealing, positional and B-factor refinement protocols in X-PLOR 
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5 (Brunger, X-PLOR Version 3.843, New Haven, Connecticut: Yale University, 1996) using a 
maximum-iikelihood target (Adams, et al., Proc. Natl Acad Set USA (1997) 94:5018-23). 
Anisotropic scaling and a bulk solvent correction were used and all B-factors were refined 
isotropically. Except for the R fr ee set (a random sampling consisting of 8% of the data set), all data 
between 41 and 1.9 A (with no a cutoff) were included. The final model consisted of residues 306- 

10 551, the ligand and 78 waters. According to PROCHECK (CCP4, 1994), 91.6% of all residues in 
the model were in the core regions of the Ramachandran plot and none were in the disallowed 
regions. Thus, the structure of the OHT-hERa LBD complex has been refined against data of 
comparable resolution (1.90 A) to a crystallographic R-factor of 23.0% (Rfr ee ==26.2%). Atomic 
coordinates of OHT-hERa LBD complex are attached as Appendix 3. 

15 Example 17: Structural analysis of hTRIJ LBD:GRIP 1 NR-box 2 peptide complex 
A. Structure of cocrvstal complex (contents of asu^ 

The asymetric unit (asu) of the crystal contains two monomers of the hTRB LBD and two 
molecules of the GRIP1 NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ 
ID NO: 6), which observes the NCS relation of the two TR monomers (Figure 12). The structure 
20 of the hTRB LBD, which closely resembles that of the rTRa LBD (Wagner et al., supra\ consists 
of twelve alpha-helices and two fi-strands organized in three layers, resembling an alpha-helical 
sandwich. The only significant difference between the hTRB LBD and the rTRa LBD is disorder in 
the loop between helices HI and H3. The ORIP1 NR-box 2 peptide forms an amphipathic a-helix 
of about 3 turns, preceded by 2 residues and followed by 3 residues in extended coil conformation. 

25 The relation of the two monomers of the hTRB LBD is primarily translational, and does not 

resemble the homodimer structures reported for the hRXR, or the hER (Bourguet et al., supra : 
Brzozowski et al., suvrd ). Furthermore, the interface between the two monomers does not involve 
residues necessary for formation of the physiological TR dimer. Instead, one of the cocrystal 
peptides appears to bridge the interaction between the two monomers. The hydrophobic face of the 

30 alpha-helix of the cocrystal peptide contacts monomer 1 of the hTRB LBD at H3, H5, and HI 2, 
while the hydrophilic face contacts monomer 2 at the hairpin turn preceding strand S3. The second 
cocrystal peptide also contacts monomer 2 at H3, H5, and H12, and the two cocrystal peptides 
observe the same NCS relation as TR LBD monomers. 
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5 The common interface between both cocrystal peptides and the hTRIi LBD buries the 

hydrophobic residues that define the cocrystal peptide (SEQ ID NO: 1) LxxLL sequence motif, 
residues Ile689, Leu690, Leu693, and Leu694; against the surface of the receptor LBD (Figures 16 
and 17). The presence of the second peptide in the crystal, duplicating the interactions of the 
hydrophobic residues, suggests those interactions are specific and drive the interaction of the 
10 peptide with the hTRB LBD, while the hydrophilic interactions provide a fortuitous crystal contact 
and account for the dependence of crystallization on the presence and concentration of the peptide. 

B. Structure of the GRIP1 NR-box 2 peptide 

The GRIP1 NR-box 2 peptide used in the crystallization is 13 amino acids long (residues 
12-24 of SEQ ID NO: 6; 686-KHKILHRLLQDSS-698). For the NR-box 2 peptide in monomer 1 

15 (peptide 1), 12 amino acids are ordered in the crystal. Residues K688 - Q694 form an amphipathic 
helix, with residues K686-H687 and D695-S698 on either end in extended coil conformations. For 
the NR-box 2 peptide in monomer 2 (peptide 2), residues K688 - Q694 again form an amphipathic 
helix, but the ends of the peptide are disordered. While the resolution of the current data prevents 
absolute assignment of hydrogen bonds, it is evident from the periodicity of the side chain density 

20 that the central residues form an alpha-helix. In the absence of TR the far UV-CD spectrum of the 
GRIP1 NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6) appears 
to be random coil (data not shown). Stable helix formation may thus be induced by the interaction 
of the hydrophobic amino acids with the receptor LBD as it has been proposed in other 
proteinrprotein interactions, such as p53:MDM2 (Kussie et al., Science (1996) 274:948-953), 

25 VP1 6:TAF3 1 (Uesugi et al., Science (1 996) 2 77: 1 3 1 0-1 3 1 3), and CREB:KIX-CBP (Radhakrishnan 
et al., Cell (1997) P7:741-752). 

C- Structure of the hTRB LBD:GRTP1 NR-box 2 peptide interface 

The hTRB LBD of the cocrystal contributes residues from three helices, H3, H5, and HI 2 to 
the interface, which pack against one another to create a hydrophobic cleft. The residues lining the 
30 cleft are 1280, T281, V283, V284, A287, and K288 from H3; Q301, 1302, L305, and K306 from 
H5; and L454, E457, V458, and F459 from HI 2. A cysteine residue (C309) from H6 appears to 
provide a partial surface that is buried deep within the bottom of the cleft. 

The GRIP1 NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 
6) binds at the junction of H3 and HI 2. Leu690 of the bound peptide inserts into a shallow but 

45 



BNSDOCID; <WO. 



.996001 4A3_IA> 



WO 99/060014 



PCT/US99/06899 



5 defined depression at the base of the cleft, making van der Waals contact with L454 and V458 of 
H12, while peptide residue Ile689 packs against L454 of H12 outside the edge of the cleft; L454, 
then, interdigitates between the two residues. One further turn C-terminal along the alpha-helix, 
L693 and L694 of the bound peptide pack into complementary pockets within the hydrophobic 
cleft. Peptide residue L693 forms van der Waals contact with V284 of H3, while peptide residue 

10 L694, bound more deeply in the cleft, makes contact with F298 and L305 of H4 and H5. The 
hydrophobic interactions of the GRIP1 NR-box 2 peptide with the hTRfl LBD are observed for both 
cocrystal peptides 1 and 2 in their respective monomers of the crystal dimer complex, suggesting 
that the interactions are specific to the peptide, and not induced by crystallization. 

Example 18: Overall Structure of the DES-hERcc-LBD-NR-box 2 Peptide Complex 

15 The asymmetric unit of the DES-hERot LBD-NR-box 2 peptide 686-KHBCILHRLLQDSS- 

698 (residues 12-24 of SEQ ID NO: 6) complex crystals contains the same noncrystallographic 
dimer of LBDs that has been observed in the previously determined structures of the LBD bound to 
both E2 and RAL (Brzozowski, et al., supra and Tanenbaum, et al., supra). Beyond the flexible 
loops between helices 2 and 3 and helices 9 and 10, the two LBDs of the dimer adopt similar 

20 structures (r.m.s.d. 0.47 A based on Cot positions). The conformation of each LBD complexed with 
DES closely resembles that of the LBD bound to E2 (Brzozowski, et al., supra) : each monomer is a 
wedge shaped molecule consisting of three layers of eleven to twelve helices and a single beta 
hairpin. In each LBD, the hydrophobic face of helix 12 is packed against helices 3, 5/6 and 11 
covering the ligand binding pocket. One NR-box 2 peptide is bound to each LBD in a hydrophobic 

25 cleft composed of residues from helices 3, 4, 5 and 12 and the turn between 3 and 4. The density 
for both peptides in the asymmetric unit is continuous and unambiguous. Residues 687 to 697 from 
peptide A and residues 686 to 696 from peptide B have been modeled; the remaining residues are 
disordered. Given that each peptide lies within a different environment within the crystal, it is 
striking that from residues He 689 to Gin 695 each peptide forms a two turn, amphipathic a helix. 

30 Flanking this region of common secondary structure, the peptides adopt dissimilar random coil 
conformations. 

Example 19: Structure of the OHT-hERcc LBD Complex 

The binding of OHT induces a conformation of the hERoc LBD that differs in both 
secondary and tertiary structural organization from that driven by DES binding. In the DES 
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5 complex, the main chain from residues 339 to 341, 421 to 423, and 527 to 530 form parts of helices 
3, 8 and 1 1 respectively. In contrast, these regions adopt an extended conformation in the OHT 
complex. In addition, the composition and orientation of helix 12 are different in the two structures. 
Helix 12 in the DES complex consists of residues 538 to 546 whereas helix 12 in the OHT complex 
consists of residues 536 to 544. Most dramatically, rather than covering the ligand binding pocket 
10 as it does in the DES complex, helix 12 in the OHT complex occupies the part of the coactivator 
binding groove formed by residues from helices 3, 4, and 5, and the turn connecting helices 3 and 4. 
This alternative conformation of helix 12 appears to be similar to that observed in the RAL complex 
(Brzozowski, et aL, supra). 

Example 20: Coactivator binding site structure and function 

15 A. TR coactivator binding site 

The above examples demonstrate that nuclear receptors, exemplified by TR, GR and ER, are 
recognized by specific coactivators that bind thereto through a coupling surface comprising a 
hydrophobic cleft and a charged hydrophobic perimeter. Identification and characterization of this 
coupling surface and the coactivator binding site of nuclear receptors offers a new target for the 
20 design and selection of compounds that modulate binding of coactivator to nuclear receptors. 

Residues forming the coactivator binding site were found to cluster within a surprisingly 
small area with well-defined borders (see, e.g., Figures 5, 14, and 15). As is shown in above 
Examples, mutated residues nearby this area do not affect coactivator binding or transcriptional 
activation. Additionally, the coactivator binding assays and structural analyses demonstrated that 
25 NR-box containing proteins and peptides bind to this site. These results also showed that the 
GRIP1 coactivator protein binds to the site through a highly (SEQ ID NO: 1) LxxLL. 

The structural analyses showed that residues contacting a conserved leucine residue of the 
(SEQ ID NO: 1) LxxLL motif included V284, F293, 1302, L305 and L454. Residues within 4.5A 
of an atom of the bound peptide included T281, V284, K288, F293, Q301, 1302, L305, K306, P453, 

30 L454 and E457. Structural analyses also revealed two other features of the site: a hydrophobic 

residue from helix 12 (Phe459) that contributes to local packing, and a cysteine residue contributed 
by helix 6 (Cys309) that provides a partial surface buried deep within the site. Mutational analyses 
showed that residues which block GRIP1 and SRC-1 coactivator binding when mutated are residues 
V284, K288, 1302, K306, L454, and V458. Mutated residues likely to undergo a conformational 

35 change upon hormone binding included Leu454 and Glu457. Thus, the site identified by 

47 



BNSDOCID; <WO 996001 4A3_IA> 



WO 99/060014 



PCT/US99/06899 



5 mutational, binding assays and crystallography corresponds to a surprisingly small cluster of 
residues on the surface of the LBD that define a prominent hydrophobic cleft formed by 
hydrophobic residues corresponding to human TR residues of C-terminal helix 3 (Ile280, Val283, 
Val284, and AJa287), helix 4 (Phe293), helix 5 (Ile302 and Leu305), helix 6 (Cys309), and helix 12 
(Leu454, Val458 and Phe459). Collectively, the Examples indicate that residues forming the site 

10 are amino acids corresponding to human TR residues of C-terminal helix 3 (Ile280, Thr281, 

Val283, Val284, Ala287, and Lys288), helix 4 (Phe293), helix 5 (Gln301, Ile302, Leu305, Lys306), 
helix 6 (Cys309), and helix 12 (Pro453, Leu454, Glu457, Val458 and Phe459). The coactivator 
binding site is highly conserved among the nuclear receptor super family (Figure 19). 

The coactivator binding site of TR contains charged and hydrophobic residues at its 
15 periphery, but only hydrophobic residues at its center (see, e.g., Figures 5 and 18). The 
hydrophobic cleft at the center of the site may play a significant role in driving the coactivator 
binding reaction. The site is comprised of two parts (Figure 18), right). Residues contained in 
helices 3, 5 and 6 (Figure 18, yellow residues) likely form a constitutive part, since their positions 
are identical in all nuclear receptor structures reported, including the liganded, activated states of 
20 the TR, RAR, and ER, the unliganded RXR, and the inhibitor-liganded ER. By contrast, the 
residues of helix 12 (Figure 18, red residues) are differently positioned in the active and inactive 
states reported. Thus the coactivator binding site for the nuclear receptors is likely to be formed in 
response to an* active hormone by positioning helix 12 against a scaffold formed by helices 3-6. 
Because the coactivator binding site is so small, it is easy to understand how even slight changes in 
25 the position of helix 12, which may, for example, be induced by an antagonist ligand, could impair 
coactivator binding, and thus receptor activation. 

B„ ER coactivator binding site 

Binding of the NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID 
NO: 6) to the ERoc LBD buries 1000 A 2 of predominantly hydrophobic surface area from both 

30 molecules. The NR-box 2 peptide binding site is a shallow groove composed of residues Leu 354, 
Val 355, He 358, Ala 361 and Lys 362 from helix 3; Phe 367 and Val 368 from helix 4; Leu 372 
from the turn between helices 3 and 4; Gin 375, Val 376, Leu 379 and Glu 380 from helix 5; and 
Asp 538, Leu 539, Glu 542 and Met 543 from helix 12. The floor and sides of this groove are 
completely nonpolar, but the ends of this groove are charged. Therefore, structural characterization 

35 of the binding site of the NR-box 2 peptide 686-KHKILHRLLQDSS-698 (residues 12-24 of SEQ 
ID NO: 6) to the ERa LBD, which is the same NR-box 2 peptide utilized to crystallize the T 3 -TR 
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5 LBD, supports the findings for TR that residues forming the coactivator binding site of nuclear 
receptors is composed of a well defined hydrophobic cleft and a charged hydrophobic perimeter. 
These residues are highly conserved among the nuclear receptor super family (Figure 19). 
Structural characterization of the coactivator peptide-bound ER LBD also supports the concept of 
exploiting the slight differences among the coactivator binding sites of nuclear receptors in 
1 0 designing and identifying compounds that target specific nuclear receptors. 

The ERa LBD interacts primarily with the hydrophobic face of the NR-box 2 peptide 686- 
KHKILHRLLQDSS-698 (residues 12-24 of SEQ ID NO: 6) a helix formed by the side chains of 
He 689 and the three (SEQ ID NO: 1) LxxLL motif leucines (Leu 690, Leu 693 and Leu 694). The 
side chain of Leu 690 is deeply embedded within the groove and forms van der Waals contacts with 

15 the side chains of lie 358, Val 376, Leu 379, Glu 380 and Met 543. The side chain of Leu 694 is 
similarly isolated within the groove and makes van der Waals contacts with the side chains of He 
358, Lys 362, Leu 372, Gin 375, Val 376 and Leu 379. In contrast, the side chains of both He 689 
and the second NR box leucine, Leu 693, rest against the rim of the groove. The side chain of He 
689 lies in a shallow depression formed by the side chains of Asp 538, Leu 539 and Glu 542. The 

20 side chain of Leu 693 makes nonpolar contacts with the side chains of lie 358 and Leu 539. 

The charged and polar side chains which form the hydrophilic face of the peptide helix 
project away from the ERa receptor and either interact predominantly with solvent or form 
symmetry contacts. None of the side chains of the polar and charged residues outside the helical 
region of either peptide in the asymmetric unit, with the exception of Lys 688 of peptide B, is 
25 involved in hydrogen bonds or salt bridges with its associated ERa LBD monomer. The e-amino 
group of Lys 688 of peptide B hydrogen bonds to the side chain carboxylate of Glu 380 of 
monomer B. This interaction is presumably a crystal artifact; the main chain atoms of the N- 
terminal three residues of peptide B are displaced from monomer B and interact extensively with a 
symmetry-related ERa LBD. 

30 In addition to interacting with the hydrophobic face of the peptide helix, the ERa LBD 

stabilizes the main chain conformation of the NR box peptide by forming capping interactions with 
both ends of the peptide helix. Glu 542 and Lys 362 are positioned at opposite ends of the peptide 
binding site. The side chains of Glu 542 and Lys 362 form van der Waals contacts with main chain 
and side chain atoms at the N- and C-tenninal turns of the peptide helix respectively. These 

35 interactions position the stabilizing charges of the y-carboxylate of Glu 542 and e-amino group of 
Lys 362 near the ends of the NR box peptide helix. The side chain carboxylate of Glu 542 
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5 hydrogen bonds to the amides of the residues of N-terminal turn of the peptide helix (residues 688 
and 689 of peptide A; residues 689 and 690 of peptide B). Similarly, the s-amino group of Lys 362 
hydrogen bonds to the carbonyls of the residues of the C-terminal turn of the peptide helix (residue 
693 of peptide A; residues 693 and 694 of peptide B). 

Except for the orientation of helix 12, the structure of the peptide binding groove of the ERa 
10 LBD is almost identical in the DES and OHT complexes. The region of this groove outside of helix 
12 is referred to herein as the "static region" of the NR box binding site. Helix 12 in the OHT 
complex and the NR box peptide helix in the DES complex interact with the static region of the 
coactivator recognition groove in strikingly similar ways. 

Helix 12 mimics the hydrophobic interactions of the NR box peptide with the static region 
15 of the groove with a stretch of residues (residues 540 to 544) that resembles an NR box ((residues 6- 
10 of SEQ ID NO: 43) LLEML instead of (SEQ ID NO: 1) LxxLL). The side chains of Leu 540 
and Met 543 lie in approximately the same locations as those of the first and second motif leucines 
(Leu 690 and Leu 693) in the peptide complex. Leu 540 is inserted into the groove and makes van 
der Waals contacts with Leu 354, Val 376 and Glu 380. Met 543 lies along the edge of the groove 
20 and forms van der Waals contacts with the side chains of Leu 354, Val 355 and He 358. The side 
chain position of Leu 544 almost exactly overlaps that of the third NR box leucine, Leu 694. Deep 
within the groove, the Leu 544 side chain makes van der Waals contacts with the side chains of He 
358, Lys 362, Leu 372, Gin 375, Val 376 and Leu 379. 

Helix 12 in the OHT complex is also stabilized by N- and C-terminal capping interactions. 

25 Lys 362 interacts with the C-terminal turn of helix 12 much as it does with the equivalent turn of 
the peptide helix. The Lys 362 side chain packs against the C-terminal turn of the helix 12 with its 
£-amino group hydrogen bonding to the carbonyls of residues 543 and 544. Given that the capping 
interaction at the N-terminal turn coactivator helix is formed by a helix 12 residue (Glu 542), the N- 
terminal turn of helix 12 in the antagonist complex is forced to interact with another residue, Glu 

30 380. The Glu 380 y-carboxylate forms van der Waals contacts with Tyr 537 and interacts with the 
amide of Tyr 537 through a series of water-mediated hydrogen bonds. 

In addition to forming these "NR box-like" interactions, helix 12 also forms van der Waals 
contacts with areas of the ERa LBD outside of the coactivator recognition groove. The side chain 
of Leu 536 forms van der Waals contacts with Glu 380 and Tip 383 and that of Tyr 537 forms van 
35 der Waals contacts with His 373, Val 376 and Glu 380. As a result of these contacts, helix 12 in the 
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5 OHT complex buries more solvent accessible surface area (-1200 A 2 ) than the NR box peptide in 
the DES-ERa LBD-peptide complex. 

Identification and characterization of the coactivator binding site for TR, and extension of 
this information to other nuclear receptors shows that this site is common for all nuclear receptors 
identified to date. Additionally, sequence and structural comparison, coupled with the Examples 

10 showing differential specificity for coactivator binding to TR, GR and ER, reveal that minor 

differences between the receptors, such as found in helix 12, are likely to influence specificity of a 
coactivator for different types of nuclear receptors. Thus, the Examples presented herein 
demonstrate that information derived from the structure and function of the TR coactivator binding 
site can be applied in design and selection of compounds that modulate binding of coactivator 

1 5 proteins to nuclear receptors for all members of the nuclear receptor super family. 
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Atomic Coordinates for Human TR-B Complexed With T 3 , and a GRIP1 NR-box 2 Peptide 
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ATOM 181 CD2 PHE A 286 25.931 29.218 39.199 1.00 33.83 

ATOM 182 CE1 PHE A 286 27.331 26.860 39.161 1.00 39.55 

ATOM 183 CE2 PHE A 286 27.307 29.233 39.387 1.00 38.08 

ATOM 184 CZ PHE A 286 28.008 28.052 39.371 1.00 34.44 

ATOM 185 C PHE A 286 23.631 27.105 41.083 1.00 36.83 

ATOM 186 O PHE A 286 24.317 26.192 41.504 1.00 35.61 

ATOM 187 N ALA A 287 23.393 28.228 41.752 1.00 37.33 

ATOM 188 CA ALA A 287 23.917 28.448 43.087 1.00 36.34 

ATOM 189 CB ALA A 287 23.523 29.828 43.555 1.00 36.40 

ATOM 190 C ALA A 287 23.346 27.393 44.027 1.00 38.76 

ATOM 191 O ALA A 287 23.994 26.973 44.981 1.00 41.98 

ATOM 192 N LYS A 288 22.114 26.979 43.735 1.00 38.28 

ATOM 193 CA LYS A 288 21.429 25.971 44.538 1.00 45.26 

ATOM 194 CB LYS A 288 19.994 25.746 44.054 1.00 48.35 

ATOM 195 CG LYS A 288 19.025 26.819 44.464 1.00 51.43 

ATOM 196 CD LYS A 288 17.628 26.246 44.682 1.00 60.23 

ATOM 197 CE LYS A 288 17.135 25.478 43.485 1.00 62.81 

ATOM 198 NZ LYS A 288 17.196 26.327 42.268 1.00 64.69 

ATOM 199 C LYS A 288 22.120 24.632 44.536 1.00 43.31 

ATOM 200 O LYS A 288 21.967 23.857 45.462 1.00 45.66 

ATOM 201 N LYS A 289 22.865 24.366 43.467 1.00 41.70 

ATOM 202 CA LYS A 289 23.571 23.120 43.351 1.00 40.67 

ATOM 203 CB LYS A 289 23.655 22.708 41.877 1.00 42.25 

ATOM 204 CG LYS A 289 22.271 22.492 41.247 1.00 39.53 

ATOM 205 CD LYS A 289 22.331 21.606 40.012 1.00 43.19 

ATOM 206 CE LYS A 289 20.941 21.362 39.447 1.00 45.74 

ATOM 207 N2 LYS A 289 20.273 20.165 40.006 1.00 52.49 

ATOM 208 C LYS A 289 24.948 23.185 44.003 1.00 41.50 

ATOM 209 O LYS A 289 25.642 22.184 44.080 1.00 39.77 

ATOM 210 N LEU A 290 25.312 24.370 44.490 1.00 40.68 

ATOM 211 CA LEU A 290 26.594 24.583 45.149 1.00 39.33 

ATOM 212 CB LEU A 290 27.15.3 25.972 44.829 1.00 36.14 

ATOM 213 CG LEU A 290 27.358 26.290 43.365 1.00 34.81 

ATOM 214 CD1 LEU A 290 27.945 27.675 ' 43.208 1.00 29.07 

ATOM 215 CD2 LEU A 290 28.267 25.242 42.757 1.00 33.45 

ATOM 216 C LEU A 290 26.434 24.405 46.652 1.00 40.08 

ATOM 217 O LEU A 290 25.803 25.235 47.333 1.00 42.00 

ATOM 218 N PRO A 291 27.028 23.333 47.210 1.00 40.27 

ATOM 219 CD PRO A 291 27.851 22.330 46.519 1.00 39.65 

ATOM 220 CA PRO A 291 26.905 23.096 48.659 1.00 38.28 

ATOM 221 CB PRO A 291 27.755 21.860 48.911 1.00 35.88 

ATOM 222 CG PRO A 291 28.202 21.355 47.585 1.00 34.19 

ATOM 223 C PRO A 291 27.327 24.298 49.522 1.00 40.05 

ATOM 224 O PRO A 291 26.571 24.739 50.391 1.00 41.33 

ATOM 225 N MET A 292 28.522 24.843 49.299 1.00 40.59 

ATOM 226 CA MET A 292 29.021 25.957 50.097 1.00 42.86 

ATOM 227 CB MET A 292 30.313 26.475 49.477 1.00 43.28 

ATOM 228 CG MET A 292 31.269 25.378 49.050 1.00 50.35 

ATOM 229 SD MET A 292 ' 32.895 26.096 48.757 1.00 51.17 

ATOM 230 CE MET A 292 33.812 24.647 48.074 1.00 54.63 

ATOM 231 C MET A 292 27.984 27.066 50.149 1.00 41.05 

ATOM 232 O MET A 292 27.986 27.886 51.057 1.00 39.66 

ATOM 233 N PHE A 293 27.080 27.078 49.172 1.00 39.30 

ATOM 234 CA PHE A 293 26.030 28.091 49.114 1.00 40.92 
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44. 


99 


8 


ATOM 


1361 


C 


SER 


A 


437 


21 


.110 


52. 


158 


36. 


055 


1. 


00 


38. 


39 


6 


ATOM 


1362 


O 


SER 


A 


437 


20 


.049 


52. 


745 


36. 


204 


1. 


00 


37. 


54 


8 


ATOM 


1363 


N 


ARG 


A 


438 


21 


.432 


51. 


483 


34. 


956 


1. 


00 


37. 


32 


7 


ATOM 


1364 


CA 


ARG 


A 


438 


20 


.534 


51. 


379 


33. 


821 


1. 


00 


39. 


30 


6 


ATOM 


1365 


CB 


ARG 


A 


438 


21 


.114 


50. 


402 


32. 


786 


1. 


00 


42. 


97 


6 


ATOM 


1366 


CG 


ARG 


A 


438 


22 


.343 


50. 


911 


32. 


051 


1. 


00 


41. 


72 


6 


ATOM 


1367 


CD 


ARG 


A 


438 


21 


.955 


52. 


134 


31. 


251 


1. 


00 


45. 


23 


6 


ATOM 


1368 


NE 


ARG 


A 


438 


20 


.964 


51. 


839 


30. 


237 


1. 


00 


45. 


66 


7 



BNSDOC1D: <WO 996001 4A3_IA> 



86 

SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



ATOM 


1369 


CZ 


ARG 


A 


4 38 


20 


.063 


52 


7i«ays9 W 




6 


ATOM 


1370 


NH1 


ARG 


A 


438 


20 


.046 


53 


.958 


30 


.318 


i 


-00 


50 


. 91 


7 


ATOM 


1371 


NH2 


ARG 


A 


438 


19 


. 198 


52 


.354 


28 


.865^ 


l 


.00 


46 


.86 


7 


ATOM 


1372 


C 


ARG 


A 


438 


19 


.147 


50 


.922 


34 


.240^ 


\ 






.37 


6 


ATOM 


1373 


0 


ARG 


A 


438 


18 


.147 


51 


.297 


33 


.625 


i 


.00 


40 


.58 


8 


ATOM 


1374 


N 


PHE 


A 


439 


19 


. 080 


50 


.120 


35 


.298 


i 


.00 


42 


.25 


7 


ATOM 


1375 


CA 


PHE 


A 


439 


. 17 


.803 


49 


.624 


35 


.763 


i 


.00 


42 


.81 


6 


ATOM 


1376 


CB 


PHE 


A 


439 


17 


.975 


48 


.794 


37 


.013 


i 


.00 


42 


. 18 


6 


ATOM 


1377 


CG 


PHE 


A 


439 


16 


.739 


48 


.053 


37 


.413 


i 


.00 


42 


. 48 


6 


ATOM 


1378 


CD1 


PHE 


A 


439 


16 


.198 


47 


.111 


36 


.562 


i 


.00 


47 


. 09 


6 


ATOM 


1379 


CD2 


PHE 


A 


439 


16 


. 105 


48 


-320 


38 


. 613 


i 


. 00 


39 


.76 


6 


ATOM 


1380 


CE1 


PHE 


A 


439 


15 


.047 


46 


.427 


36 


.905 


i 


.00 


49 


. 17 


6 


ATOM 


1381 


CE2 


PHE 


A 


439 


14 


.940 


47 


.630 


38 


.963 


i 


.00 


45 


. 10 


6 


ATOM 


1382 


CZ 


PHE 


A 


439 


14 


. 411 


46 


. 683 


38 


.098 


i 


. 00 


46 


. 36 


6 


ATOM 


1383 


C 


PHE 


A 


439 


16 


.921 


50 


.803 


36 


.075 


i 


. 00 


44 


.79 


6 


ATOM 


1384 


O 


PHE 


A 


439 


15 


. 830 


50 


.903 


35 


.554 


i 


.00 


40 


.26 


8 


ATOM 


1385 


N 


LEU 


A 


440 


17 


.410 


51 


.681 


36 


.951 


i 


. 00 


42 


.77 


7 


ATOM 


1386 


CA 


LEU 


A 


440 


16 


.660 


52 


.871 


3.7 


. 344 


i 


. 00 


42 


. 96 


6 


ATOM 


1387 


CB 


LEU 


A 


440 


17 


.546 


53 


.824 


38 


.150 


i 


.00 


37 


. 19 


6 


ATOM 


1388 


CG 


LEU 


A 


440 


17 


.943* 


53 


.297 


39 


. 500 


i 


. 00 


36 


. 97 


6 


ATOM 


1389 


CD1 


LEU 


A 


440 


18 


.620 


54 


.38 9 


40 


. 316 


i 


.00 


33 


. 65 


6 


ATOM 


1390 


CD2 


LEU 


A 


440 


16 


.679 


52 


.837 


40 


.216 


i 


. 00 


35 


. 42 


6 


ATOM 


1391 


C 


LEU 


A 


440 


16 


.025 


53 


.596 


36 


.168 


i 


. 00 


45 


- 47 


6 


ATOM 


1392 


O 


LEU 


A 


440 


14 


.809 


53 


.750 


36 


. 126 


i 


. 00 


52 


. 4 8 


8 


ATOM 


1393 


N 


HIS 


A 


441 


16 


.836 


54 


.060 


35 


.223 


i 


. 00 


49 


. 15 


7 


ATOM 


1394 


CA 


HIS 


A 


441 


16 


.277 


54 


. 725 


34 


.0 63 


i 


. 00 


54 


. 7 6 


6 


ATOM 


1395 


CB 


HIS 


A 


441 


17 


.329 


54 


.955 


33 


. 031 


i 


. 00 


56 


. 68 


6 


ATOM 


1396 


CG 


HIS 


A 


441 


18 


.134 


56 


.161 


33 


.282 


i 


. 00 


62 


. 73 


6 


ATOM 


1397 


CD2 


HIS 


A 


441 


18 


.468 


57 


.216 


32 


. 499 


i 


. 00 


65 


. 73 


6 


ATOM 


1398 


ND1 


HIS 


A 


441 


18 


.701 


56 


.431 


34 


.538 


i 


. 00 


66 


. 01 


7 


ATOM 


1399 


CE1 


HIS 


A 


441 


19 


.332 


57 


.594 


34 


.473 


i 


. 00 


65 


. 55 


6 


ATOM 


1400 


NE2 


HIS 


A 


441 


19 


.205 


58 


.085 


33 


.255 


i 


. 00 


60 


. 09 


7 


ATOM 


1401 


C 


HIS 


A 


441 


15 


.244 


53 


.822 


33 


.481 


i 


. 00 


55 


. 93 


6 


ATOM 


1402 


O 


HIS 


A 


441 


14 


.149 


54 


.263 


33 


.170 


i 


. 00 


57 


. 33 


8 


ATOM 


1403 


N 


MET 


A 


442 


15 


.605 


52 


.549 


33 


.313 


i 


. 00 


57 


. 81 


7 


ATOM 


1404 


CA 


MET 


A 


442 


14 


.661 


51 


.583 


32 


.778 


i 


. 00 


59 , 


. 11 


6 


ATOM 


1405 


CB 


MET 


A 


442 


15 


.191 


50 


.154 


32 


.922 


i 


. 00 


55. 


. 93 


6 


ATOM 


1406 


CG 


MET 


A 


442 


16 


.336 


49 


.813 


32 


.022 


i , 


. 00 


58 , 


. 52 


6 


ATOM 


1407 


SD 


MET 


A 


442 


16 


.681 


48 


.008 


31 


.851 


i , 


. 00 


60 , 


. 99 


16 


ATOM 


1408 


CE 


MET 


A 


442 


17 


.085 


47 


.602 


33 


.581 


i , 


. 00 


52 , 


. 61 


6 


ATOM 


1409 


C 


MET 


A 


442 


13 


.339 


51. 


.727 


33 


.534 


i , 


. 00 


60 , 


. 31 


6 


ATOM 


1410 


O 


MET 


A 


442 


12 


.266 


51. 


.560 


32, 


.968 


i « 


. 00 


58 , 


. 18 


8 


ATOM 


1411 


N 


LYS 


A 


443 


13 


.425 


52, 


.054 


34, 


.818 


i . 


. 00 


61 . 


. 45 


7 


ATOM 


1412 


CA 


LYS 


A 


443 


12 


.236 


52, 


.202 


35. 


.626 


i. 


. 00 


64. 


. 90 


6 


ATOM 


1413 


CB 


LYS 


A 


443 


12 


.608 


52, 


.141 


37. 


.090 


i . 


. 00 


64 . 


. 40 


6 


ATOM 


1414 


CG 


LYS 


A 


443 


11 


.461 


51, 


.748 


37. 


.959 


i . 


. 00 


69. 


. 12 


6 


ATOM 


1415 


CD 


LYS 


A 


443 


12, 


.068 


51. 


.551 


39, 


.257 


i . 


. 00 


71 . 


,14 


6 


ATOM 


1416 


CE 


LYS 


A 


443 


11. 


.368 


51. 


.897 


40. 


.091 


i , 


, 00 


73 . 


.43 


6 


ATOM 


1417 


NZ 


LYS 


A 


443 


11, 


.883 


51, 


.712 


41. 


.415 


i. 


. 00 


67. 


, 97 


7 


ATOM 


1418 


C 


LYS 


A 


443 


11 


.513 


53, 


.514 


35. 


.348 


i. 


00 


67. 


,29 


6 


ATOM 


1419 


O 


LYS 


A 


443 


10. 


.390 


53. 


.700 


35. 


.780 


i. 


00 


67. 


90 


8 


ATOM 


1420 


N 


VAL 


A 


444 


12. 


.171 


54. 


.429 


34. 


,629 


i. 


00 


66. 


57 


7 


ATOM 


1421 


CA 


VAL 


A 


444 


11 . 


.575 


55, 


,719 


34. 


297 


i . 


00 


64 . 


76 


6 


ATOM 


1422 


CB 


VAL 


A 


444 


12 . 


.569 


56. 


,869 


34. 


,560 


i. 


00 


62 . 


76 


6 



87 



BNSDOCID: <WO 



.99600 14A3_IA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



1423 

1424 

1425 

1426 

1427 

1428 

1429 

1430 

1431 

1432 

1433 

1434 

1435 

1436 

1437 

1438 

1439 

1440 

1441 

1442 

1443 

1444 

1445 

1446 

1447 

1448 

1449 

1450 

1451 

1452 

1453 

1454 

1455 

1456 

1457 

1458 

1459 

1460 

1461 

1462 

1463 

1464 

1465 

1466 

1467 

1468 

1469 

1470 

1471 

1472 

1473 

1474 

1475 

1476 



CGI VAL 
CG2 VAL 



C 
O 
N 

CA 
CB 
CG 
CD 



VAL 
VAL 
GLU 
GLU 
GLU 
GLU 
GLU 



OE1 GLU 
OE2 GLU 



C 

O 

N 

CA 

CB 

SG 

C 

O 

N 

CD 

CA 

CB 

CG 

C 

O 

N 

CA 
CB 



OG1 THR 
CG2 THR 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 

CYS A 446 
CYS A 44 6 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



GLU 
GLU 
CYS 



CYS 
CYS 
CYS 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
PRO 
THR 
THR 
THR 



C THR 

O THR 

N GLU 

CA GLU 

CB GLU 

CG GLU 

CD GLU 

OE1 GLU 

OE2 GLU A 449 

C GLU A 44 9 

O GLU 

N LEU 

CA LEU 

CB LEU 

CG LEU 



444 
444 
444 
444 
445 
445 
445 
445 
445 
445 
445 
445 
445 
446 



446 
446 
446 
447 
447 
447 
447 
447 
447 
447 
448 
448 
448 
448 
448 
448 
448 
449 
449 
449 
449 
449 
449 



CD1 LEU 
CD2 LEU 



C 
O 
N 

CA 
CB 
CG 



LEU 
LEU 
PHE 
PHE 
PHE 
PHE 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



449 
450 
450 
450 
450 
450 
450 
450 
450 
451 
451 
451 
451 



11-952 
12.999 
11.043 
9.937 
11.814 
11.457 
12.725 
13.598 
14.875 
15.155 
15. 663 
10.724 
10.536 
10.301 
9.628 
10.595 
12.009 
8.454 
8.495 
7.372 
7.267 
6.150 
5.187 
5.875 
6. 435 
7.181 
5.820 
6.024 
5.528 
4.105 
6.192 



5, 
4, 
4, 
3, 



.113 
.915 
.539 
.630 
2,600 
2.145 
.297 
,479 
.424 
.434 
.882 
.747 
.608 
.301 
.4 64 
.337 
5.786 
7.669 
8.427 
7.705 
8.681 
9.041 
9.873 



1 
1 
0. 
4 

3, 
5. 
6. 
7 . 
6. 
7. 



58.195 

56. 891 

55.730 

56.210 

55.173 

55.152 

55.255 

56.429 

56.472 

55.565 

57.430 

53.912 

53.701 

53.099 

51.899 

50.719 

50. 842 

51.671 

52.014 

51.133 

50.764 

50.853 

50.281 

50.271 

49.843 

48.908 

50.002 

49.066 

49.734 

49.917 

51.081 

47.912 

46.995 

48.021 

47.023 

47.773 

47.051 

47.889 

49.137 

47.322 

46.036 

45.142 

46.224 

45.347 

46.154 

46.819 

47.662 

45.750 

44. 673 

43.841 

45.039 

44.506 

45.562 

46.717 



36.035 
32.861 
32.612 
31.935 
30.514 
29.664 
30.022 
29.239 
28.414 
29.430 
30.049 
28.844 
30. 999 
30. 634 
30. 687 
29.573 
31.535 
32. 728 
30.978 
29.560 
31.740 
30.714 
29. 437 
32.831 
32.612 
33.997 
35. 113 
36. 401 
36.328 
36. 585 
34 .755 
35.519 
33.565 
33.024 
32.191 
31.001 
30.178 
30.146 
29.497 
32.179 
31.559 
32.161 
31.378 
30.277 
29.217 
28.328 
28.415 
32.243 
31.752 
33.530 
34.480 
35.540 
35. 008 



PCT7US99/06899 

^oW^4JWW 6 



i 
i 
i 
i 

i, 
i, 



00 
00 
00 
00 
00 
00 



1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1 .00 
1.00 
1.00 
1.00 
1.00 
1 .00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1. 00 
1. 00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1 .00 
1 .00 
1 .00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 
1.00 



59.27 
68.61 
70. 60 
70.71 
71.45 
72.36 
40.00 
40.00 
40.00 
40.00 
71.46 
73.02 
71.12 
70.83 
71.05 
72.83 
71.91 
72.06 
73.12 
72.88 
74.22 
72.98 
74.77 
75.94 
'76. 67 
76.91 
78 .24 
81.33 
84.46 
83.51 
77.42 
77.65 
76.29 
75.03 
74.62 
40.00 
40.00 
40.00 
40.00 
73.49 
70.24 
70.80 . 
68 .82 
71.91 
76.62 
77.95 
76.46 
66.22 
66.01 
61.96 
58.44 
61.34 
63.02 



6 
6 
8 
7 
6 
6 
6 
6 
8 
8 
6 
8 
7 
6 
6 

16 
6 
8 
7 
6 
6 
6 
6 
6 
8 
7 
6 
6 
8 
6 
6 
8 
7 
6 
6 
6 
6 

8 

8 

6 

8 

7 

6 

6 

6 

6 

6 

6 

8 

7 

6 

6 

6 



BNSDOCID: <WO_ 



_ 99600 14A3_IA> 



88 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



ATOM 


1477 


CD1 


PHE A 


A C 1 

451 


ATOM 


1478 


CD2 


PHE A 


a c i 

4bl 


ATOM 


1479 


CE1 


PHE A 


/I CI 


ATOM 


1480 


CE2 


PHE A 


/ CI 

4 Dl 


ATOM 


1481 


CZ 


PHE A 


yi C 1 

43 1 


ATOM 


1482 


C 


PHE A 


43 1 


ATOM 


1483 


O 


PHE A 


A C *1 

451 


ATOM 


1484 


N 


PRO A 


A C O 

452 


ATOM 


1485 


CD 


PRO A 


A C O 

452 


ATOM 


1486 


CA 


PRO A 


452 


ATOM 


1487 


CB 


PRO A 


452 


ATOM 


1488 


CG 


PRO A 


452 


ATOM 


1489 


C 


PRO A 


452 


ATOM 


1490 


O 


PRO A 


452 


ATOM 


1491 


N 


PRO A 


453 


ATOM 


1492 


CD 


PRO A 


453 


ATOM 


1493 


CA 


PRO A 


453 


ATOM 


1494 


CB 


PRO A 


453 


ATOM 


1495 


CG 


PRO A 


453 


ATOM 


1496 


C 


PRO A 


453 


ATOM 


1497 


O 


PRO A 


453 


ATOM 


1498 


N 


LEU A 


454 


ATOM 


1499 


CA 


LEU A 


454 


ATOM 


1500 


CB 


LEU A 


454 


ATOM 


1501 


CG 


LEU A 


454 


ATOM 


1502 


CD1 


LEU A 


454 


ATOM 


1503 


CD2 


LEU A 


454 


ATOM 


1504 


C 


LEU A 


454 


ATOM 


1505 


o 


LEU A 


454 



ATOM 1506 N PHE A 455 

35 ATOM 1507 CA PHE A 455 

ATOM 1508 CB PHE A 455 

ATOM 1509 CG PHE A 455 

ATOM 1510 CD1 PHE A 455 

ATOM 1511 CD2 PHE A 455 

40 ATOM 1512 CE1 PHE A 455 

ATOM 1513 CE2 PHE- A 455 

ATOM 1514 CZ PHE A 455 

ATOM 1515 C PHE A 455 

ATOM 1516 O PHE A 455 

45 ATOM 1517 N LEU A 45 6 

ATOM 1518 CA LEU A 45 6 

ATOM 1519 CB LEU A 456 

ATOM 1520 CG LEU A 456 

ATOM 1521 CD1 LEU A 456 

50 ATOM 1522 CD2 LEU A 456 

ATOM 1523 C LEU A 456 

ATOM 1524 O LEU A 456 

ATOM 1525 N GLU A 457 

ATOM 152 6 CA GLU A 457 

55 ATOM 1527 C GLU A 457 

ATOM 1528 O GLU A 457 

ATOM 1529 CB GLU A 457 

ATOM 1530 CG GLU A 457 



9.426 47.507 33.963 1.00 62.92 6 

11.089 47.017 35.593 1.00 63.07 6 

10.199 48.598 33.521 1-00 65.12 6 

11.860 48.102 35.156 1.00 64.66 6 

11.410 48.897 34.118 1.00 67.12 6 

8.259 43.264 35.260 1.00 56.41 6 

7.641 43.392 36.331 1.00 56.56 8 

8.555 42.045 34.755 1.00 53.28 7 

9.177 41.689 33.481 1.00 50.46 6 
8.153 40.859 35.543 1.00 50.26 6 
8.739 39.680 34.780 1.00 49.19 6 

9.178 40.206 33.482 1.00 45.89 6 
8.770 40.999 36.935 1.00 49.62 6 
9.867 41.529 37.094 1.00 52.35 8 
8.139 40.425 37.947 1.00 51.50 7 



7.001 


39. 542 


37 . 


797 


1 . 


00 


49 . 


66 


6 


8.610 


40.528 


39 . 


323 


1 . 


00 


50 . 


89 


6 


7.675 


39. 659 


40 . 


109 


1 . 


00 


51 . 


A C\ 

49 


6 


6.703 
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5 ATOM 263 CA GLU B 295 -34.839 92.159 27.816 .1.00 52.53 6 

ATOM 264 CB GLU B 295 -34.553 91.412 29.131 1.00 57.40 6 

ATOM 265 CG GLU B 295 -35.811 90.978 29.874 1.00 69.63 6 

ATOM 266 CD GLU B 295 -36.610 92.144 30.375 1.00 78.49 6 

ATOM 267 OE1 GLU B 295 -36.153 92.869 31.297 1.00 82.82 8 

10 ATOM 268 OE2 GLU B 295 -37.730 92.385 29.860 1.00 85.30 8 

ATOM 269 C GLU B 295 -33.629 93.009 27.415 1.00 48.54 6 

ATOM 270 O GLU B 295 -32.981 93.627 28.260 1.00 49.82 8 

ATOM 271 N LEU B 296 -33.374 93.030 26.109 1.00 43.79 7 

ATOM 272 CA LEU B 296 -32.268 93.761 25.540 1.00 45.42 6 

15 ATOM 273 CB LEU B 296 -31.319 92.769 24.838 1.00 41.04 6 

ATOM 274 CG LEU B 296 -30.735 91.631 25.662 1.00 42.74 6 

ATOM 275 CD1 LEU B 296 -30.354 90.478 24.764 1.00 40.99 6 

ATOM 276 CD2 LEU B 296 -29.559 92.119 26.468 1.00 39.44 6 

ATOM 277 C LEU B 296 -32.760 94.779 24.522 1.00 45.56 6 

20 ATOM 278 O LEU B 296 -33.845 94.600 23.924 1.00 43.07 8 

ATOM 279 N PRO B 297 -32.004 95.875 24.338 1.00 46.99 7 

ATOM 280 CD PRO B 297 -30.740 96.123 25.046 1.00 47.12 6 

ATOM 281 CA PRO B 297 -32.388 96.912 23.363 1.00 49.61 6 

ATOM 282 CB PRO B 297 -31.294 97.973 23.494 1.00 49.91 6 

25 ATOM 283 CG PRO B 297 -30.302 97.477 24.545 1.00 51.28 6 

ATOM 284 C PRO B 297 -32.263 96.273 21.913 1.00 49.59 6 

ATOM 285 O PRO B 297 -31.441 95.340 21.685 1.00 51.66 8 

ATOM 286 N CYS B 298 -33.035 96.667 20.854 1.00 51.02 7 

ATOM 287 CA CYS B 298 -32.761 96.150 19.456 1.00 52.86 6 

30 ATOM 288 CB CYS B 298 -33.140 97.165 18.356 1.00 54.57 6 

ATOM 289 SG CYS B 298 -34.884 97.085 17.836 1.00 67.87 16 

ATOM 290 C CYS B 298 -31.385 96.330 19.127 1.00 48.51 6 

ATOM 291 O CYS B 298 -30.579 95.506 18.744 1.00 49.58 8 

ATOM 292 N GLU B 299 -31.107 97.447 19.230 1.00 44.17 7 

35 ATOM 293 CA GLU B 299 -29.989 97.645 18.718 1.00 47.57 6 

ATOM 294 CB GLU B 299 -29.402 98.973 19.208 1.00 49.92 6 

ATOM 295 CG GLU B 299 -29.944 100.187 18.433 1.00 59.30 6 

ATOM 296 CD GLU B 299 -31.090 100.887 19.164 1.00 63.80 6 

ATOM 297 OE1 GLU B 299 -31.673 101.904 18.629 1.00 69.03 8 

40 ATOM 298 OE2 GLU B 299 -31.473 100.458 20.319 1.00 67.10 8 

ATOM 299 C GLU B 299 -28.993 96.533 18.987 1.00 46.57 6 

ATOM 300 O GLU B 299 -28.200 96.179 18.111 1.00 44.65 8 

ATOM 301 N ASP B 300 -29.045 95.989 20.203 1.00 45.17 7 

ATOM 302 CA ASP B 300 -28.152 94.908 20.584 1.00 43.32 6 

45 ATOM 303 CB ASP B 300 -27.985 94.849 22.105 1.00 37.38 6 

ATOM 304 CG ASP B 300 -27.239 96.016 22.650 1.00 36.23 6 

ATOM 305 OD1 ASP B 300 -26.208 96.421 22.052 1.00 35.87 8 

ATOM 306 OD2 ASP B 300 -27.661 96.543 23.716 1.00 40.14 8 

ATOM 307 C ASP B 300 -28.721 93.591 20.071 1.00 42.81 6 

50 ATOM 308 O ASP B 300 -28.001 92.775 19.489 1.00 46.02 8 

ATOM 309 N GLN B 301 -30.019 93.399 20.306 1.00 38.60 7 

ATOM 310 CA GLN B 301 -30.712 92.197 19.858 1.00 40.00 6 

ATOM 311 CB GLN B 301 -32.234 92.418 19.836 1.00 38.59 6 

ATOM 312 CG GLN B 301 -32.908 92.380 21.187 1.00 40.26 6 

55 ATOM 313 CD GLN B 301 -34.401 92.583 21.083 1.00 44.15 6 

ATOM 314 OE1 GLN B 301 -34.859 93.637 20.589 1.00 45.73 8 

ATOM 315 NE2 GLN B 301 -35.165 91.602 21.544 1.00-46.13 7 

ATOM 316 C GLN B 301 -30.237 91.830 18.455 1.00 41.64 6 
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5 ATOM 1181 CA VAL B 414 -16.845 60.674 27.788 1.00 51.06 6 

ATOM 1182 CB VAL B 414 -17.317 61.498 26.586 1.00 51.49 6 

ATOM 1183 CGI VAL B 414 -16.133 62.122 25.891 1.00 45.22 6 

ATOM 1184 CG2 VAL B 414 -18.095 60.631 25.617. 1.00 52.67 6 

ATOM 1185 C VAL B 414 -16.096 61.557 28.775 1.00 54.28 6 

10 ATOM 1186 O VAL B 414 -16.700 62.283 29.574 1.00 55.49 8 

ATOM 1187 N THR B 415 -14.770 61.466 28.742 1.00 56.28 7 

ATOM 1188 CA THR B 415 -13.919 62.234 29.669 1.00 57.83 6 

ATOM 1189 CB THR B 415 -12.488 61.686 29.658 1.00 59.64 6 

ATOM 1190 OG1 THR B 415 -11.618 62.572 30.373 1.00 66.69 8 

15 ATOM 1191 CG2 THR B 415 -11.988 61.483 28.227 1.00 59.42 6 

ATOM 1192 C THR B 415 -13.840 63.726 29.352 1.00 56.98 6 

ATOM 1193 O THR B 415 -13.987 64.135 28.216 1.00 55.70 8 

ATOM 1194 N ' HIS B 416 -13.598 64.522 30.387 1.00 57.44 7 

ATOM 1195 CA HIS B 416 -13.485 65.972 30.237 1.00 57.34 6 

20 ATOM 1196 CB HIS B 416 -12.114 66.326 29.653 1.00 61.35 6 

ATOM 1197 CG HIS B 416 -10.968 65.931 30.513 1.00 69.78 6 

ATOM 1198 CD2 HIS B 416 -9.930 65.082 30.307 1.00 71.42 6 

ATOM 1199 ND1 HIS B 416 -10.756 66.480 31.787 1.00 72.49 7 

ATOM 1200 CE1 HIS B 416 -9.631 65.973 32.281 1.00 75.50 6 

25 ATOM 1201 NE2 HIS B 416 -9.120 65.131 31.408 1.00 73.91 7 

ATOM 1202 C HIS B 416 -14.560 66.515 29.320 1.00 53.79 6 

ATOM 1203 0 HIS B 416 -14.334 67.477 28.591 1.00 52.81 8 

ATOM 1204 N PHE B 417 -15.746 65.921 29.372 1.00 48.05 7 

ATOM 1205 CA PHE B 417 -16.841 66.329 28.505 1.00 47.99 6 

30 ATOM 1206 CB PHE B 417 -18.152 65.694 28.937 1.00 46.11 6 

ATOM 1207 CG PHE B 417 -19.233 65.781 27.898 1.00 44.27 6 

ATOM 1208 CD1 PHE B 417 -19.280 64.856 26.870 1.00 41.79 6 

ATOM 1209 CD2 PHE B 417 -20.118 66.846 27.893 1.00 40.23 6 

ATOM 1210 CE1 PHE B 417 -20.233 64.959 25.869 1.00 44.30 6 

35 ATOM 1211 CE2 PHE B 417 -21.072 66.955 26.893 1.00 36.80 6 

ATOM 1212 CZ PHE B 417 -21.119 66.016 25.866 1.00 40.69 6 

ATOM 1213 C PHE B 417 -17.020 67.833 28.423 1.00 46.69 6 

ATOM 1214 O PHE B417 -16.799 68.423 27.380 1.00 43.35 8 

ATOM 1215 N TRP B 418 -17.448 68.452 29.516 1.00 45.14 7 

40 ATOM 1216 CA TRP B 418 -17.681 69.889 29.508 1.00 44.89 6 

ATOM 1217 CB TRP B 418 -18.045 70.398 30.898 1.00 42.24 6 

ATOM 1218 CG TRP B 418 -18.162 71.905 31.018 1.00 47.11 6 

ATOM 1219 CD2 TRP B 418 -19.298 72.699 30.620 1.00 46.98 6 

ATOM 1220 CE2 TRP B 418 -18.953 74.061 30.850 1.00 48.94 6 

45 ATOM 1221 CE3 TRP B 418 -20.560 72.401 30.086 1.00 45.23 6 

ATOM 1222 CD1 TRP B 418 -17.223 72.778 31.462 1.00 46.24 6 

ATOM 1223 NE1 TRP B 418 -17.690 74.071 31.368 1.00 50.63 7 

ATOM 1224 CZ2 TRP B 418 -19.819 75.109 30.571 1.00 45.46 6 

ATOM 1225 CZ3 TRP B 418 -21.422 73.447 29.809 1.00 44.50 6 

50 ATOM 1226 CH2 TRP B 418 -21.065 74.777 30.039 1.00 47.55 6 

ATOM 1227 C TRP B 418 -16.502 70.662 28.956 1.00 43.88 6 

ATOM 1228 O TRP B 418 -16.671 71.424 27.986 1.00 43.17 8 

ATOM 1229 N PRO B 419 -15.292 70.490 29.519 1.00 43.55 7 

ATOM 1230 CD PRO B 419 -14.967 69.551 30.599 1.00 41.52 6 

55 ATOM 1231 CA PRO B 419 -14.120 71.223 29.011 1.00 41.48 6 

ATOM 1232 CB PRO B 419 -12.956 70.582 29.724 1.00 39.21 6 

ATOM 1233 CG PRO B 419 -13.521 69.703 30.774 1.00 39.25 6 

ATOM 1234 C PRO B 419 -14.035 71.067 27.479 1.00 36.28 6 
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ATOM 1318 O ARG B 429 

35 ATOM 1319 N MET B 430 

ATOM 1320 CA MET B 430 





ATOM 


1321 


CB 


MET 


B 


430 




ATOM 


1322 


CG 


MET 


B 


430 




ATOM 


1323 


SD 


MET 


B 


430 


40 


ATOM 


1324 


CE 


MET 


B 


430 




ATOM 


1325 


C 


MET 


B 


430 




ATOM 


1326 


O 


MET 


B 


430 




ATOM 


1327 


N 


ILE 


B 


431 




ATOM 


1328 


CA 


ILE 


B 


431 


45 


ATOM 


1329 


CB 


ILE 


B 


431 




ATOM 


1330 


CG2 


ILE 


B 


431 




ATOM 


1331 


CGI 


ILE 


B 


431 




ATOM 


1332 


CD1 


ILE 


B 


431 




ATOM 


1333 


C 


ILE 


B 


431 


50 


ATOM 


1334 


o 


ILE 


B 


431 




ATOM 


1335 


N 


GLY 


B 


432 




ATOM 


1336 


CA 


GLY 


B 


432 




ATOM 


1337 


C 


GLY 


B 


432 




ATOM 


1338 


O 


GLY 


B 


432 


55 


ATOM 


1339 


N 


ALA 


B 


433 




ATOM 


1340 


CA 


ALA 


B 


433 




ATOM 


1341 


CB 


ALA 


B 


433 




ATOM 


1342 


C 


ALA 


B 


433 



-12.153 78.174 19.782 1.00 25.40 6 
-14.677 77.742 18.706 1.00 32.53 6 
-14.639 78.530 17.763 1.00 35.19 8 
-14.749 76.425 18.566 1.00 28.83 7 
-14.796 75.807 17.257 1.00 35.12 6 
-15.096 74.302 17.380 1.00 39.14 6 
-13.910 73.496 17.806 1.00 45.80 6 
-12.786 73.774 17.348 1.00 41.97 8 
-14.064 72.517 18.583 1.00 50.06 8 
-15.883 76.502 16.429 1.00 33.94 6 
-15.673 76.815 15.262 1.00 38.02 8 
-17.040 76.741 17.048 1.00 27.15 7 
-18.154 77.388 16.367 1.00 29.99 6 
-19.448 77.190 17.168 1.00 22.49 6 
-20.086 75.818 17.089 1.00 25.54 6 
-21.282 75.729 18.012 1.00 20.60 6 
-20.509 75.564 15.651 1.00 17.24 6 
-17.901 78.863 16.103 1.00 28.94 6 
-18.328 79.388 15.076 1.00 31.26 8 
-17.213 79.524 17.035 1.00 27.64 7 
-16.894 80.937 16.883 1.00 28.13 6 
-16.274 81.507 18.160 1.00 29.59 6 
-17.246 81.752 19.302 1.00 34.85 6 
-16.626 82.653 20.372 1.00 47.18 6 
-17.373 82.714 21.620 1.00 57.93 7 
-18.632 83.124 21.716 1.00 63.62 6 
-19.263 83.579 20.622 1.00 60.71 7 
-19.238 83.130 22.916 1.00 62.38 7 
-15.930 81.146 15.728 1.00 29.81 6 
-16.101 82.061 14.933 1.00 30.81 8 
-14.908 80.295 15.670 1.00 29.64 7 
-13.920 80.343 14.614 1.00 34.72 6 
-12.939 79.192 14.763 1.00 34.97 6 
-11.787 79.431 15.689 1.00 45.34 6 
-10.729 80.768 15.158 1.00 52.55 16 
-10.070 80.157 13.610 1.00 55.56 6 
-14.638 80.217 13.284 1.00 34.01 6 
-14.395 80.996 12.385 1.00 37.29 8 
-15.516 79.217 13.176 1.00 29.99 7 
-16.296 78.992 11.963 1.00 28.82 6 
-17.391 77.929 12.177 1.00 27.39 6 
-18.314 77.841 10.959 1.00 23.87 6 
-16.784 76.555 12.449 1.00 25.56 6 
-17.826 75.464 12.498 1.00 17.29 6 
-16.953 80.288 11.538 1.00 29.49 6 
-16.837 80.725 10.398 1.00 24.19 8 
-17.657 80.904 12.474 1.00 25.25 7 
-18.357 82.142 12.179 1.00 30.38 6 
-17.395 83.209 11.725 1.00 32.75 6 
-17.531 83.740 10.637 1.00 36.38 8 
-16.431 83.522 12.586 1.00 26.77 7 
-15.407 84.514 12.299 1.00 26.48 6 
-14.240 84.338 13.253 1.00 19.90 6 
-14.905 84.433 10.867 1.00 30.73 6 
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1349 
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1351 

1352 

1353 
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1358 
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ALA B 
CYS B 
CYS B 
CYS B 
CYS B 
CYS B 
CYS B 
HIS B 
HIS B 
HIS B 
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HIS B 
HIS B 
HIS B 
HIS B 
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ALA B 
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SER B 
SER B 
SER B 
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SER B 
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ARG B 
ARG B 
ARG B 
ARG B 
ARG B 
ARG B 
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PHE B 
PHE B 
PHE B 
PHE B 
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434 
434 
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-14 .849 
-14.534 
-14.023 
-13.553 
-12.412 
-15. 106 
-14.844 
-16.318 
-17.395 
-18.700 
-19.845 
-20.483 
■ -20. 345 
-21.241 
-21.341 
-17. 528 
-17.842 
-17.315 
-17.376 
-17. 008 
-16. 393 
-16.734 
-15.162 
-14.122 
-12. 882 
-11 . 855 
-14. 642 
-14.700 
-15. 008 
-15. 526 
-16.019 
-14.910 
-14.044 
-14.781 
-14. 482 
-13. 458 
-15.219 
-16.659 
-16.841 
-17. 417 
-18.531 
-19.198 
-20.487 
-21.535 
-20. 638 
-22.735 
-21.851 
-22.901 
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-17.021 
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89.717 

88.911 

87.319 

87.354 

88 .117 

89.115 

89.718 

90.588 



10.171 
10.439 
9.120 
9.226 
8.444 
8.062 
6. 952 
8.394 
7.443 
8.103 
7.149 
6.515 
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5. 
5. 
6. 
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672 
605 
878 
5.715 
7.758 
7.405 
8 . 618 
6.266 
5.257 
6. 448 
5.445 
5.847 
879 
108 
127 
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908 
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356 
235 
040 
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34 .09 
34.89 
34.30 
35.44 
31.76 
32.03 
"28.61 
28.48 
33.27 
31.57 
32.74 
32.87 
31.01 
29.91 
21 .23 
33. 86 
36. 10 
35. 19 
33.03 
35.31 
44 . 99 
38.39 
37.54 
37 . 32 
39.30 
42. 97 
41.72 
45.23 
45.66 
49.71 
50. 91 
46.86 
42.37 
40. 58 
42.25 
42. 81 
42. 18 
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47. 09 
39. 76 
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46.36 
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36.97 
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ATOM 1419 N LYS B 443 

ATOM 1420 CA LYS B 443 

ATOM 1421 CB LYS B 443 

30 ATOM 1422 CG LYS B 443 

ATOM 1423 CD LYS B 443 

ATOM 1424 CE LYS B 443 

ATOM 1425 NZ LYS B 443 
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-14.219 91.W*MU&.ll&**?f<to"5ttS 6 

-16.577 91.591 2.574 1.00 35.42 6 
-16.099 88.561 -0.273 1.00 45.47 6 
-16.631 89.059 -1.265 1.00 52.48 8 
-15.238 87.549. -0.345 1.00 49.15 7 
-14.929 86.956 -1.632 1.00 54.76 6 
-14.150 85.700 -1.448 1.00 56.68 6 
-12.713 85.934 -1.230 1.00 62.73 6 
-11.602 85.418 -1.812 1.00 65.73 6 
-12.245 86.850 -0.273 1.00 66.01 7 
-10.916 86.847 -0.309 1.00 65.55 6 
-10.512 85.993 -1.228 1.00 60.09 7 
-16.217 86.633 -2.301 1.00 55.93 6 
-16.418 86.938 -3.465 1.00 57.33 8 
-17.106 85.997 -1.553 1.00 57.81 7 
-18.399 85.652 -2.106 1.00 59.11 6 
-19.340 85.162 -1.008 1-00 55.93 6 
—18.991 83.796 -0.456 1.00 58.52 6 
-20.310 82.994 0.505 1.00 60.99 16 
-20.525 84.203 1.827 1.00 52.61 6 
-18.991 86.879 -2.785 1.00 60.31 6 
-19.646 86.778 -3.817 1.00 58.18 8 
-18.731 88.045 -2.213 1.00 61.45 7 
-19.267 89.268 -2.758 1.00 64.90 6 
-19.182 90.358 -1.704 1.00 64.40 6 
-20.160 91.449 -1.982 1.00 69.12 6 
-19.763 92.673 -1.306 1.00 71.14 6 
-20.508 92.993 -0.491 1.00 73.43 6 
-20.174 94.242 0.151 1.00 67.97 7 
-18.528 89.704 -4.020 1.00 67.29 
-18.979 90.586 -4.731 1.00 67.90 
-17.383 89.075 -4.285 1.00 66.57 
-16.589 89.418 -5.455 1.00 64.76 
-15.097 89.568 -5.082 1.00 62.76 
-14.269 89.857 -6.298 1.00 64.00 
-14.905 90.678 -4.042 1.00 59.27 
-16.800 88.397 -6.569 1.00 68.61 
-16.968 88.774 -7.729 1.00 70.60 
-16.812 87.118 -6.219 1.00 70.71 
-16.951 86.033 -7.197 1.00 71.45 
-16.169 84.809 -6.712 1.00 72.36 
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19 .864 
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16 . 629 
15.051 
14 . 189 
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17.856 
18 .314 
17.529 
18.267 
16 . 125 
15 . 062 
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21. 601 
21. 721 
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22 . 634 
21 . 849 
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20.392 
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24.530 
24.564 • 
23.319 - 
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34 . 85 
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27 .752 
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27 .195 
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29 .735 
29.205 
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26 .215 
26.288 
24 . 898 
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23 .946 
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26 . 878 
28 .306 
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29 . 231 

30 . 110 
30 . 127 
28 .538 
27 .692 
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-8.153 -5.044 
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-8.654 -6.480 
-7.775 -7.415 
-7 .781 -7.414 
-7.793 -7.091 
-7.725 -8.762 
-7.038 -7.692 
-7.457 -9.855 
-6 .218 -8 .675 
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727 SD 



ILE 
ILE 
GLY 
GLY 
GLY 
GLY 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
LEU 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
VAL 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
TRP 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
ARG 
SER 
SER 
SER 
SER 
SER 
SER 
MET 
MET 
MET 
MET 
MET 



A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 
A 



A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 



389 
389 
390 
390 
390 
390 
391 
391 
391 
391 
391 
391 
391 
391 
392 
392 
392 
392 
392 
392 
392 
393 
393 
393 
393 
393 
393 
393 
393 
393 
393 
393 
393 
393 
3 93 
394 
394 
394 
394 
394 
394 
394 
394 
394 
394 
394 
395 
3 95 
395 
395 
395 
395 
396 
396 
396 
396 
396 



15.333 

15.813 

15.410 

16 . 049 

.15.243 

15.801 

13 . 920 

13 . 018 

11.561 
10 .480 
10.579 
9 .115 
13 .208 
13 .440 
13 .122 
13 .282 
13 .186 
13 .733 . 
11.739 
14.626 
14.728 
15 . 652 

16 . 999 

17 . 977 

19 . 287 

20 . 341 
21.375 
20 . 512 
19. 710 
20 . 963 
22.566 
21.698 
22 . 709 
17 . 082 
17.767 
16 .399 
16 .412 

15 .776 

16 .243 
15 .830 
14 .443 

14 . 053 
14 . 944 
12 .783 

15 . 622 
15 . 889 
14 . 638 
13 .776 

12 .395 
11.916 

14 .316 

13 . 726 

15 .437 
16.061 
17 .466 
17 .585 
19 . 192 



5 . 

6 . 
4 . 

6 . 

7 . 
6 , 
7 
6 



7 . 
6 , 
6 , 
7 
7 



322 - 
453 - 
477 - 
895 - 
021 
994 - 
,888 - 
4.887 - 
4.420 ■ 
497 ■ 
156 
868 
216 
255 
170 
357 
.993 
8 . 129 

6 .712 

8 . 014 

9 .242 

7 . 186 
670 
491 
784 
605 
612 

8 .335 
6.339 

6 . 833 

8 .323 

9 . 044 
9 . 030 
8 .414 
9.435 

7 . 897 

8 .531 
7 .633 

195 
551 
071 
912 
3 . 108 
3 . 544 

9 . 833 
10 . 729 

9 . 924 
11 . 104 
10 .696 

9 .530 
12 .240 
13 .324 
11.986 
12 . 954 
12 . 483 
11. 919 
12 . 262 



6 . 
5 
5 
3 



922 
970 
943 
183 
819 
318 
787 
343 
. 194 
.342 
. 725 
. 148 
. 620 
4.243 
2 .295 
469 
042 
897 
414 
754 
-1.832 
-1 . 924 
-2.204 
-2 . 199 
-2 . 857 
-2 .339 
-3 . 302 
-1 . 154 
-4 . 077 
-4 . 351 
-3 . 120 
-0 . 971 
-1. 950 
-3 . 547 
-3 . 650 
-4. 568 
-5. 890 
-6 . 965 
-7 . 024 
-8.352 
-8.363 
-7 . 841 
-7.267 
-7 . 907 
-5 . 879 
-6 . 677 
-4 . 988 
-4 . 902 
-4 .382 
-5 • 029 
-4.033 
-3 . 977 
-3 .368 
-2 .475 
-2 . 112 
-0 .715 
0 . 004 



1.00 
1.00 
1.00 



1. 
1 . 
1. 
1 . 
1 . 
1 . 
1. 
1 , 
1 , 
1 
1 
1 
1 
1 
1 
1 
1. 
1 . 
1 . 
1. 
1 . 
1 . 
1 . 
1. 
1 - 
1. 
1. 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 



00 
00 
00 
00 
00 
00 
00 
00 
. 00 
.00 
.00 
.00 
.00 
.00 
.00 
. 00 
. 00 
. 00 
.00 
.00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
, 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
1. 00 
1. 00 
1. 00 



, 00 
. 00 
. 00 
. 00 
.00 



1.00 
1.00 



, 00 
. 00 
.00 
. 00 
. 00 
.00 



18 . 67 

19 .75 

20 . 58 
19 .33 

17 .48 
21.87 
19 . 17 

21 . 50 

18 . 25 
21.98 
21 .39 
17 . 15 
23 .27 
23 . 60 

23 .04 

24 . 42 
27 .38 
30 .37 
23 .48 
27 . 55 

27 . 50 

23 . 65 

24 .76 
22 .86 

25 . 90 

28 . 09 

29 . 94 

30 . 20 
26.55 
30 . 64 
32 .43 
34.58 
36 . 54 
25 . 02 
20 . 97 

23 . 06 
25 . 97 

24 . 05 
26.05 

22 .70 
20 .71 
21.26 
20 . 09 
21.26 

23 .40 
28 .61 

26 . 65 

27 .46 
26 .70 
22 . 95 
31.45 

28 . 11 
33.83 
38.83 
39 .47 
41.37 
42 .20 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 

ATOM 



728 

729 

730 

731 

732 

733 

734 

735 

736 

737 

738 

739 

740 

741 

742 

743 

744 

745 

746 

747 

748 

749 

750 

751 

752 

753 

754 

755 

756 

757 

758 

759 

760 

761 

762 

763 

764 

765 

766 

767 

768 

769 

770 

771 

772 

773 

774 

775 

776 

777 

778 

779 

780 

781 

782 

783 

784 



CE 

G 

O 

N 

CA 

CB 

CG 

CD 

OE1 

OE2 

C 

O 

N 

CA 

CB 

CG 

CD 2 

ND1 

CE1 

NE2 

C 

o 

N 

CD 

CA 

CB 

CG 

C 

O 

N 

CA 

C 

O 

N 

CA 

CB 

CG 

CD 

CE 

N2 

C 

O 

N 

CA 

CB 

CG 

CD1 

CD 2 

C 

O 

N 

CA 

CB 

CG 

CD1 

CD 2 

C 



MET 

MET 

MET 

GLU 

GLU 

GLU 

GLU 

GLU 

GLU 

GLU 

GLU 

GLU 

HIS 

HIS 

HIS 

HIS 

HIS 

HIS 

HIS 

HIS 

HIS 

HIS 

PRO 

PRO 

PRO 

PRO 

PRO 

PRO 

PRO 

GLY 

GLY 

GLY 

GLY 

LYS 

LYS 

LYS 

LYS 

LYS 

LYS 

LYS 

LYS 

LYS 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 

LEU 



A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 

A 



396 

396 

396 

397 

397 

397 

397 

397 

397 

397 

397 

397 

398 

398 

398 

398 

398 

398 

398 

398 

398 

398 

399 

399 

399 

399 

399 

399 

399 

400 

400 

400 

400 

401 

401 

401 

401 

401 

401 

401 

401 

401 

402 

402 

402 

402 

402 

402 

402 

402 

403 

403 

403 

403 

403 

403 

403 

144. 



. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
.00 
, 00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 



20.263 11.996 -1.404 1 
16.143 14.376 -3.018 1 

15.637 ' 15.316 -2.403 1 
16.794 14.526 -4.166 1 
16.971 15.831 -4.790 1 
18.184 15.785 -5.729 1 

17.883 15.189 -7.096 1 

19.117 14.665 -7.810 1 

19.219 13.430 -7.990 1 

19.980 15.485 -8.196 1 

15.735 16.322 -5.554 1 

15.830 17.229 -6.376 1, 

14.579 15.728-5.280 1, 

13.342 16.118 -5.950 1 

12.924 15.043 -6.956 1* 

13.870 14.886 -8.104 1 

13.904 15.484 -9.318 1. 

14.940 14.017 -8.074 1. 

15.592 14.086 -9.220 1. 

14.985 14.969 -9.993 1. 

12.216 16.332 -4.944 1. 

11.282 15.535 -4.864 l! 

12.283 17.427 -4.171 1. 
13.328 18.467 -4 . 198 
11.243 17.709 -3 .173 
11.603 19.101 -2.654 
13.050 19.267 -2.963 

9.828 17.663 -3 .744 1 

9 . 554 18 . 249 -4 . 789 1 

8.938 16.954 -3 .057 1 

7.559 16.865 -3 .503 1 

7.230 15.706 -4.428 1 

6 . 063 15 .344 -4 . 574 1 

8.237 15.112 -5.055 1, 

7.972 14.007 -5.966 l] 

8.235 14.430 -7.415 1, 

8.130 15.927 -7.675 1. 

9.096 16.353 -8.774 1 

8.733 17.721 -9.331 1." 

7.295 18.027 -9.116 1. 

8.768 12.746 -5.677 1." 

9.809 12.776 -5.006 l.'oo 

8.256 11.635 -6.197 

8.889 10.334 -6.050 

7.866 9.294 -5.590 

7.265 9.555 -4.207 

6.126 8 ,583 -3 . 937 

8.355 9.416 -3 . 157 

9.448 9.948 -7.414 

8.704 9.836 -8 .389 

10.761 9.770 -7.487 

11.393 9.400 -8 .744 

12.825 9.937 -8.816 
13.401 10.027-10.238 
14.519 11.046-10 .288 

13.915 8.665-10.676 

11.419 7.891 -8 .901 



1. 00 
1.00 
1 .00 
1. 00 
. 00 
. 00 
. 00 
. 00 
, 00 
00 
00 
00 
00 
00 
00 
00 
00 
00 



1 

1 

1 

1 

1 

1 

1 

1 . 

1 , 

1. 

1 . 

1. 

1 . 

1 . 

1 . 



. 00 
. 00 
. 00 
. 00 
. 00 
, 00 
00 
00 
00 
00 
00 
00 
00 
00 
00 



42 . 84 
40 . 69 

38 .85 
42.19 
44 . 80 
46 . 02 
54 .42 

59 .40 

60 . 63 
62.71 
42 . 94 
44 . 68 
40 . 82 
39.21 
39.05 
41.57 

39 . 28 
41. 85 

40 . 88 
42 .30 
37 . 04 
36.51 
39 . 19 
35.36 
37 . 10 

-37.86 
35 . 83 

37 . 02 

38 . 52 
33 .58 
32.12 
32.43 

33 .21 
31.35 
30 . 75 

35 .43 
35.15 

36 . 88 
36.71 

34 . 22 
30 . 97 
27.60 
27.28 
29.07 
22 .55 
24 . 94 
19.32 
21.54 
28.78 
29 . 98 
27.57 
27.17 
26.95 

,30.42 
30.76 
33.11 
24 . 78 



BNSDOCIO: <WO 996001 4A3 J A> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



5 ATOM 785 O LEU A 403 

' ATOM 786 N PHE A 404 

ATOM ' 787 CA PHE A 404 

ATOM 788 CB PHE A 404 

ATOM 78 9 CG PHE A 404 

10 ATOM 790 CD1 PHE A 404 





ATOM 


791 ' 


CD 2 


PHE A 


404 




ATOM 


792 


CE1 


PHE A 


404 




ATOM 


793 


CE2 


PHE A 


404 




ATOM 


794 


CZ 


PHE A 


404 


X <J 


ATOM 


795 


C 


PHE A 


404 




ATOM 


796 


O 


PHE A 


404 




ATOM 


797 


N 


ALA A 


405 




ATOM 


798 


CA 


ALA A 


405 




ATOM 


799 


CB 


ALA A 


405 


90 


ATOM 


800 


c 


ALA A 


405 




ATOM 


801 


O 


ALA A 


405 




ATOM 


802 


N 


PRO A 


406 




ATOM 


803 


CD 


PRO A 


406 




ATOM 


804 


CA 


PRO A 


406 


Zj 


ATOM 
±±X KJVl 


805 


CB 


PRO A 


406 




A TOM 


806 


CG 


PRO A 


406 




ATOM 


807 


c 


PRO A 


406 




ATOM 


808 


o 


PRO A 


406 




ATOM 


809 


N 


ASN A 


407 




ATOM 


810 


CA 


ASN A 


407 




7ATOM 
J-i. X yJrl 


811 


CB 


ASN A 


407 




ATOM 


812 


CG 


ASN A 


407 




ATOM 

Jr\. X vl 1 


813 


OD1 


ASN A 


407 




ATOM 


814 


ND2 


ASN A 


407 




ATOM 


815 


C 


ASN A 


407 




ATOM 


816 


O 


ASN A 


407 




ATOM 


817 


N 


LEU A 


408 




ATOM 


818 


CA 


LEU A 


408 




ATOM 


819 


CB 


LEU A 


408 


40 


ATOM 


820 


CG 


LEU A 


408 




ATOM 


821 


CD1 


LEU A 


408 




ATOM 


822 


CD 2 


LEU A 


408 




ATOM 


823 


C 


LEU A 


408 




ATOM 


824 


O 


LEU A 


408 


45 


ATOM 


825 


N 


LEU A 


409 




ATOM 


826 


CA 


LEU A 


409 




ATOM 


827 


CB 


LEU A 


409 




ATOM 


828 


CG 


LEU A 


409 




ATOM 


829 


CD1 


LEU A 


409 


50 


ATOM 


830 


CD 2 


LEU A 


409 




ATOM 


831 


C 


LEU A 


409 




ATOM 


832 


O 


LEU A 


409 




ATOM 


833 


N 


LEU A 


410 




ATOM 


834 


CA 


LEU A 


410 


55 


ATOM 


835 


CB 


LEU A 


410 




ATOM 


836 


CG 


LEU A 


410 




ATOM 


837 


CD1 LEU A 


410 




ATOM 


838 


CD 2 LEU A 


410 




ATOM 


839 


C 


LEU A 


410 


60 ATOM 


840 


i O 


LEU A 


410 




ATOM 


841 N 


ASP A 


411 



12.428 7.257 -8.619 1.00 24.68 

10.306 7.319 -9.344 1.00 23.11 

10.239 5.881 -9.546 1.00 26.93 

8.826 5.470 -9.946 1.00 27.04 

7.850 5.513 -8.816 1.00 27.89 

7.028 6.623 -8.631 1.00 26.20 

7.750 4.444 -7.925 1.00 23.10 

6.116 6.668 -7.573 1.00 25.29 

6.845 4.481 -6.870 1.00 21.01 

6.026 5.595 -6.693 1.00 22.91 

11.232 5.507-10.637 1.00 26.04 

11.882 4.464-10.578 1.00 27.27 

11.348 6.383-11.626 1.00 28.80 

12.271 6.195-12.740 1.00 29.21 

11.650 5.287-13.806 1.00 26.89 

12.549 7.578-13.317 1.00 30.23 

11.770 8.508-13.109 1.00 27.38 

13.672 7.737-14.032 1.00 30.05 

14.712 6.745-14.352 1.00 26.31 

13.977 9.053-14.604 1.00 32.10 

15.232 8.800-15.438 1.00 31.28 

15.865 7.602-14.776 1.00 31.44 

,12.820 9.589-15.436 1.00 32.58 

12.605 10.796-15.507 1.00 32.58 

12.063 8.690-16.053 1.00 32.86 

10.935 9.119-16.865 1-00 32.78 

10.950 8.418-18.228 1.00 34.73 

10.884 6.907-18.121 1.00 35.37 

11.189 6.317-17.077 1.00 30.24 

10.486 6.268-19.215 1.00 34.08 

9.605 8.901-16.166 1.00 34.90 

8.549 8.897-16.798 1.00 36.09 

9.660 8.724-14.851 1.00 33.56 

8.452 8.544-14.061 1.00 35.59 

8.141 7.062-13.851 1.00 33.81 

6.696 6.823-13.397 1.00 36.44 

5.746 7.479-14.390 1.00 34.14 

6.406 5.334-13.287 1.00 32.96 

8.607 9.245-12.717 1.00 38.03 

8.880 8.614-11.695 1.00 36.38 

8.441 10.563-12.741 1.00 37.87 

8.548 11.395-11.553 1.00 37.95 

9.373 12.636-11.877 1.00 39.52 

10.023 13.399-10.728 1.00 42.46 

11.100 12.547-10.082 1.00 43.24 

10.614 14.691-11.266 1.00 46.05 

7.132 11.792-11.163 1.00 37.13 

6.482 12.546-11.882 1.00 35.70 

6.654 11.284-10.030 1.00 35.29 

5.297 11.576 -9.583 1.00 33.33 

4.503 10.277 -9.449 1.00 29.37 

4.645 9.238-10.560 1.00 32.75 

4.026 7.925-10.104 1.00 29.16 

3.958 9.744-11.819 1.00 30.70 

5.207 12.332 -8.261 1.00 3 5.14 

6.078 12.214 -7.400 1.00 36.94 

4.141 13.108 -8.105 1.00 34.76 
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SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO_ 



_996001 4A3_IA> 



WO 99/060014 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 


842 


CA 


ASP 


A 


411 


3 


.93^ 


ATOM 


843 


CB 


ASP 


A 


411 


3 


. 733 


ATOM 


844 


CG 


ASP 


A 


411 


2 


.471 


ATOM 


845 


OD1 


ASP 


A 


411 


1 


.570 


ATOM 


846 


OD2 


ASP 


A 


411 


2 


.383 


ATOM 


847 


C 


ASP 


A 


411 


2 


.727 


ATOM 


848 


O 


ASP 


A 


411 


2 


.033 


ATOM 


849 


N 


ARG 


A 


412 


2 


. 480 


ATOM 


850 


CA 


ARG 


A 


412 


1 


.375 


ATOM 


851 


CB 


ARG 


A 


412 


1 


.260 


ATOM 


852 


CG 


ARG 


A 


412 


0 


.562 


ATOM 


853 


CD 


ARG 


A 


412 


0 


.454 


ATOM 


854 


NE 


ARG 


A 


412 


-0 


.261 


ATOM 


855 


CZ 


ARG 


A 


412 


-1 


.574 


ATOM 


856 


NH1 


ARG 


A 


412 


-2 


.316 


ATOM 


857 


NH2 


ARG 


A 


412 


-2 


. 150 


ATOM 


858 


C 


ARG 


A 


412 


0 


. 034 


ATOM 


859 


O 


ARG 


A 


412 


-0 


.775 


ATOM 


860 


N 


ASN 


A 


413 


-0 


. 198 


ATOM 


861 


CA 


ASN 


A 


413 


-1 


.458 


ATOM 


862 


CB 


ASN 


A 


413 


-1 


.518 


ATOM 


863 


CG 


ASN 


A 


413 


-1 


.739 


ATOM 


864 


OD1 


ASN 


A 


413 


-2 


.376 


ATOM 


865 


ND2 


ASN 


A 


413 


-1 


.213 


ATOM 


866 


C 


ASN 


A 


413 


-1 


. 673 


ATOM 


867 


O 


ASN 


A 


413 


-2 


. 792 


ATOM 


868 


N 


GLN 


A 


414 


-0 


. 600 


ATOM 


869 


CA 


GLN 


A 


414 


-0 


. 703 


ATOM 


870 


CB 


GLtN 


A 


414 


0 


.585 


ATOM 


871 


CG 


GLiN 


A 


414 


0 


.572 


ATOM 


872 


CD 


GLN 


A 


414 


1 


. 914 


ATOM 


873 


OE1 


GliN 


A 


414 


2 


.591 


ATOM 


874 


NE2 


GLN 


A 


414 


2 


.309 


ATOM 


875 


C 


GLN 


A 


414 


-0 


.970 


ATOM 


876 


O 


GLN 


A 


414 


-1 


.491 


ATOM 


877 


N 


GLY 


A 


415 


-0 


.618 


ATOM 


878 


CA 


GLY 


A 


415 


-0 


.836 


ATOM 


879 


C 


GLY 


A 


415 


-2 


.306 


ATOM 


880 


O 


GLY 


A 


415 


-2 


696 


ATOM 


881 


N 


LYS 


A 


416 


-3 . 


129 


ATOM 


882 


CA 


LYS 


A 


416 


-4. 


566 


ATOM 


883 


CB 


LYS 


A 


416 


-5. 


212 


ATOM 


884 


CG 


LYS 


A 


416 


-4 . 


761 


ATOM 


885 


CD 


LYS 


A 


416 


-4 . 


910 


ATOM 


886 


CE 


LYS 


A 


416 


-5. 


992 


ATOM 


887 


NZ 


LYS 


A 


416 


-5. 


416 


ATOM 


888 


C 


LYS 


A 


416 


-5. 


227 


ATOM 


889 


O 


LYS 


A 


416 


-6 . 


339 


ATOM 


890 


N 


CYS 


A 


417 


-4. 


540 


ATOM 


891 


CA 


CYS 


A 


417 


-5. 


066 


ATOM 


892 


CB 


CYS 


A 


417 


-4 . 


062 


ATOM 


893 


SG 


CYS 


A 


417 


-3 . 


916 


ATOM 


894 


C 


CYS 


A 


417 


-5. 


373 


ATOM 


895 


O 


CYS 


A 


417 


-6. 


220 


ATOM 


896 


N 


VAL 


A 


418 


-4 . 


671 


ATOM 


897 


CA 


VAL 


A 


418 


-4. 


866 


ATOM 


898 


CB 


VAL 


A 


418 


-3 . 


525 



15.341 -7 
15.645 -7 
14.785 -8 
16.764 -8 
13.234 -6 
12.395 -6 
13 . 647 -4 
13 . 099 -4 
13.824 -2 
15. 168 
15. 736 
14. 826 
14 . 855 
15.754 
13 .986 
13 . 108 
12.201 -4 
14.119 -5 
14.215 -6 
15.533 -7 
16.718 -6 
16.594 -5 
17 . 876 -6 
13 . 044 -7 
12.567 -7 
12.577 -8 
11.448 -8 
11.307 -9 
12 . 088-11 
12.. 713-11 
13 .257-10 
12 . 637-12 
10.163 -8. 

9.193 -8. 
10.168 -6, 
8.992 -6. 
8 .720 -5. 
,601 -5. 
.748 -5. 
.613 -5. 
10.996 -5. 
11.819 -4. 
13 .309 -4 . 
13 .924 -3 . 
14.764 -2. 

8.793 -6. 
8 .299 -6 . 
8 . 648 -8 . 
7.890 -9. 
7 . 902-10 . 
9 .493-11. 
6.452 -8. 

5.794 -9. 
5.968 -7. 
4.612 -7. 
3 .841 -7. 



7 
9 
9 



78^3" 
. 144 
. 928 
. 001 
.474 
. 179 
. 762 
. 940 
. 169 
. 825 
. 870 
. 465 
. 577 
.384 
. 024 
. 438 
. 889 
. 706 
. 717 
. 440 
.210 
. 299 
.249 
. 687^ 
.385 
. 546 
. 010 
. 925 
. 741 
. 049 
.375 
. 501 
. 641 
. 141 
.682 
. 860 
. 040 
. 804 
.472 
. 978 
. 779 
. 704 
.510 
777 
898 
809 
886 
714 
015 
148 
305 
168 
752 
359 
731 
232 
206 



1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1, 

1 , 

1, 

1 . 

1 . 

1 . 

1 . 

1 . 

1. 

1. 

1 . 

1. 

1. 

1. 

1. 

1 . 

1 . 

1. 

1. 

1. 



Tffb 
. oo 

. 00 
. 00 
. 00 
. 00 
.00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
. 00 
.00 
. 00 
.00 
.00 
.00 
, 00 
.00 
, 00 
.00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
00 



1.00 



00 
00 
00 
00 
00 
00 
00 
00 



35.40 

40 . 02 

41 .32 

45 . 03 
45.01 
36 .10 
34 .08 
35.99 
39.37 
39.75 
40 . 49 
40.65 
37.48 

42 .84 
40 . 82 

38 .32 
39.80 

39 .92 
41.64 

43 .19 

46 .44 
47.86 

48 . 05 

49 .43 
41.48 

40 .50 

42 . 82 

44 . 73 

47 .52 

50 .47 
53 .91 
53 . 68 
56 . 91 

43 .21 
42 .33 
41.97 
40 .43 
40 .80 
37 . 83 
42.16 

44 .34 
45.65 
47 .42 
50 . 97 
53 .25 
56.95 

45 .33 

46 . 50 
45.18 

46 .25 
49 .29 
49 . 59 

47 . 18 
46 . 50 
45 . 07 
42 . 75 
42 .45 
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BNSOOCID: <WO_ 



_9960014A3_IA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



5 


ATOM 


899 


CGI 


VAL 


A 


418 


-3 . 670 


2.563 - 


6 . 410 


1.00 


40 . 22 




ATOM 


900 


CG2 


VAL 


A 


418 


-3 . 071 


3.538 - 


8.634 


1.00 


3 8.03 




ATOM 


901 


C 


VAL 


A 


418 


-5 . 441 


4.714 - 


5.818 


1.00 


41.46 




ATOM 


902 


O 


VAL 


A 


418 


-4 . 883 


5 . 400 - 


4 . 963 


1.00 


42 . 08 




ATOM 


903 


N 


GLU 


A 


419 


-6 . 559 


4.036 - 


5.579 


1.00 


40.95 


10 


ATOM 


904 


CA 


GLU 


A 


419 


-7 . 223 


4.073 - 


4 . 275 


1.00 


42 . 51 




ATOM 


905 


CB 


GLU 


A 


419 


-8 . 536 


3 .282 - 


4 . 333 


1.00 


44 . 52 




ATOM 


906 


CG 


GLU 


A 


419 


-9 .010 


2 . 751 - 


■2 . 984 


1.00 


50 . 42 




ATOM 


907 


CD 


GLU 


A 


419 


-10 .413 


2 . 168 - 


-3 . 035 


1 . 00 


54 . 38 




ATOM 


908 


OE1 


GLU 


A 


419 


-10 . 582 


1. 059 - 


-3 . 590. 


1.00 


54 . 09 


15 


ATOM 


909 


OE2 


GLU 


A 


419 


-11.347 


2 . 820 - 


■2 . 516 


1.00 


57 . 90 




ATOM 


910 


C 


GLU 


A 


419 


-6 .370 


3.552 ■ 


-3 . 121 


1 . 00 


41 . 11 




ATOM 


911 


o 


GLU 


A 


419 


-5 . 955 


2.393 - 


-3 . 116 


1 . 00 


39 . 42 




ATOM 


912 


N 


GLY 


A 


420 


-6 . 129 


4.419 - 


-2 . 140 


1. 00 


40 . 53 




ATOM 


913 


CA 


GLY 


A' 


420 


-5.346 


4.049 ■ 


-0 . 973 


1 . 00 


37 . 61 


20 


ATOM 


914 


C 


GLY 


A 


420 


-3 . 854 


4 . 258 • 


-1 . 140 


1.00 


37.01 




ATOM 


915 


o 


GLY 


A 


420 


-3 . 088 


4.105 ■ 


-0 . 190 


1 . 00 


32 . 59 




ATOM 


916 


N 


MET 


A 


421 


-3 .444 


4.623 • 


-2 . 350 


1. 00 


36.21 




ATOM 


917 


CA 


MET 


A 


421 


-2 . 035 


4.825 


-2 . 656 


1 . 00 


36.02 




ATOM 


918 


CB 


MET 


A 


421 


-1.799 


4.607 


-4 . 160 


1 . 00 


32 . 84 


25 


ATOM 


919 


CG 


MET 


A 


421 


-0 .351 


4 .754 


-4 . 617 


1 . 00 


35 . 82 




ATOM 


920 


SD 


MET 


A 


421 


0 . 806 


3 . 611 


-3 . 812 


1. 00 


35.57 




ATOM 


921 


CE 


MET 


A 


421 


0 .881 


2 .294 


-5 . 005 


1 . 00 


32 . 51 




ATOM 


922 


C 


MET 


A 


421 


-1.474 


6 .180 , 


-2 . 226 


1 . 00 


34 . 93 




ATOM 


923 


O 


MET 


A 


421 


-0.275 


6.294 


-1.985 


1 . 00 


35 . 17 


30 


ATOM 


924 


N 


VAL 


A 


422 


-2 .319 


7 .205 


-2 . 118 


1 . 00 


33 . 97 




ATOM 


925 


CA 


VAL 


A 


422 


-1 . 823 


8 .520 


-1.708 


1 . 00 


31 . 29 




ATOM 


926 


CB 


VAL 


A 


422 


-2 . 927 


9 .607 


-1.766 


1 . 00 


33 . 14 




ATOM 


92 7 


CGI 


VAL 


A 


422 


-3 . 823 


9 .535 


-0 .533 


1 . 00 


30 . 10 




ATOM 


928 


CG2 


VAL 


A 


422 


-2 .279 


10 . 982 


-1.854 


1 . 00 


30 . 08 


35 


ATOM 


929 


C 


VAL 


A 


422 


-1.231 


8 .498 


-0 .296 


1 . 00 


32 . 64 




ATOM 


930 


O 


VAL 


A 


422 


-0 . 274 


9 .220 


0 . 002 


1 . 00 


28 .41 




ATOM 


931 


N 


GLU 


A 


423 


-1. 803 


7 . 670 


0 . 571 


1 . 00 


31.53 




ATOM 


932 


CA 


GLU 


A 


423 


-1 . 311 


7 .558 


1 . 935 


1 . 00 


35 . 99 




ATOM 


933 


CB 


GLU 


A 


423 


-2 .190 


6 .594 


2 . 73 7 


1 . 00 


40 .37 


40 


ATOM 


934 


CG 


GLU 


A 


423 


-3 . 588 


7 .129 


3 . 043 


1 . 00 


49 . 41 




ATOM 


935 


CD 


GLU 


A 


423 


-4.438 


7 .336 


1 . 795 


1 . 00 


52 .38 




ATOM 


936 


OE1 


GLU 


A 


423 


-5 . 349 


8 .188 


1. 835 


1. 00 


56 . 91 




ATOM 


937 


OE2 


GLU 


A 


423 


-4 . 200 


6 . 652 


0 . 776 


1 . 00 


54 . 53 




ATOM 


938 


C 


GLU 


A 


423 


0 . 127 


7 .043 


1.886 


1 . 00 


34 . 83 


45 


ATOM 


939 


O 


GLU 


A 


423 


1 . 007 


7 .552 


2 . 581 


1 . 00 


31 . 85 




ATOM 


940 


N 


ILE 


A 


424 


0 .369 


6 .038 


1. 050 


1 . 00 


30 . 17 




ATOM 


941 


CA 


ILE 


A 


424 


1.711 


5 .488 


0 . 929 


1 . 00 


28 . 99 




. ATOM 


942 


CB 


ILE 


A 


424 


1. 696 


4 .195 


0 . 109 


1 . 00 


30.96 




ATOM 


943 


CG2 


ILE 


A 


424 


3 . 108 


3 .588 


0 . 068 


1.00 


27 .20 


50 


ATOM 


944 


CGI 


ILE 


A 


424 


0 . 671 


3 .230 


0 . 725 


1 . 00 


30 . 77 




ATOM 


945 


CD1 


ILE 


A 


424 


0 . 810 


1 .787 


0 .291 


1 . 00 


34 . 69 




ATOM 


946 


C 


ILE 


A 


424 


2 . 700 


6 .483 


0 . 312 


1. 00 


28 .21 




ATOM 


947 


O 


ILE 


A 


424 


3 . 856 


6 .551 


0 . 735 


1. 00 


28 .48 




ATOM 


948 


N 


PHE 


A 


425 


2 .253 


7 .260 


-0 . 675 


1. 00 


27 . 68 


55 


ATOM 


949 


CA 


PHE 


A 


425 


3 . 119 


8 .253 


-1 . 315 


1. 00 


27.30 




ATOM 


950 


CB 


PHE 


A 


425 


2 .381 


8 . 958 


-2 . 458 


1 . 00 


26 .36 




ATOM 


951 


CG 


PHE 


A 


425 


2.538 


8 .289 


-3 . 798 


1 . 00 


27 . 22 




ATOM 


952 


CD1 


PHE 


A 


425 


2.619 


9 . 050 


-4 . 958 


1. 00 


27 .36 




ATOM 


953 


CD 2 


PHE 


A 


425 


2 . 566 


6.900 


-3 . 905 


1.00 


27 . 89 


60 


ATOM 


954 


CE1 


PHE 


A 


425 


2 .721 


8 .443 


-6 . 207 


1.00 


29 . 63 




ATOM 


955 


CE2 


PHE 


A 


425 


2.668 


6 .282 


-5 . 149 


1. 00 


27 .28 
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BNSDOCfD: <WO 996001 4A3_IA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATOM 


956 


CZ 


PHE 


A 


425 


2 . 


745ttUfU>©6 


'^.303 


1 


. 00 


27 


.63 


ATOM 


957 


C 


PHE 


A 


425 


3 . 


591 


9 


.306 


-0 . 


312 


1 


. 00 


25 


.66 


ATOM 


958 


O 


PHE 


A 


425 


4 . 


757 


9 


.713 


-0 . 


328 


1 


. 00 


26 


.33 


ATOM 


959 


N 


ASP 


A 


426 


2 . 


680 


9 


.746 


0 . 


552 


1 


. 00 


27 


. 92 


ATOM 


960 


CA 


ASP 


A 


426 


2 . 


984 


10 


.759 


1. 


570 


1 


. 00 


28 


. 88 


ATOM 


961 


CB 


ASP 


A 


426 


1. 


721 


11 


.102 


2 . 


369 


1 


. 00 


32 


.58 


ATOM 


962 


CG 


ASP 


A 


426 


0 . 


781 


12 


. 034 


1 . 


613 


1 


. 00 


37 


.47 


ATOM 


963 


OD1 


ASP 


A 


426 


-0. 


432 


12 


. 039 


1. 


925 


1 


- 00 


37 


. 72 


ATOM 


964 


OD2 


ASP 


A 


426 


1 . 


253 


12 


.758 


0 . 


710 


1 


. 00 


36 


.35 


ATOM 


965 


C 


ASP 


A 


426 


4 . 


071 


10 


.278 


2 . 


532 


1 


. 00 


26 


. 96 


ATOM 


966 


O 


ASP 


A 


426 


4 . 


974 


11 


.030 


2 . 


900 


1 


. 00 


2.7 


.20 


ATOM 


967 


N 


MET 


A 


427 


3 . 


978 


9 


.022 


2 . 


947 


1 


. 00 


25 


. 76 


ATOM 


968 


CA 


MET 


A 


427 


4 . 


981 


8 


.468 


3 . 


856 


1 


. 00 


25 


. 89 


ATOM 


969 


CB 


MET 


A 


427 


4 . 


567 


7 


. 070 


4 . 


309 


1 


. 00 


21 


. 17 


ATOM 


970 


CG 


MET 


A 


427 


3 . 


385 


7 


. 072 


5 . 


257 


1 


. 00 


24 


.38 


ATOM 


971 


SD 


MET 


A 


427 


3 . 


153 


5 


.489 


6 . 


080 


1 


. 00 


34 


.32 


ATOM 


972 


CE 


MET 


A 


427 


2 . 


173 


4 


. 637 


4 . 


910 


1 


. 00 


21 


. 03 


ATOM 


973 


C 


MET 


A 


427 


6 . 


321 


8 


.410 


3 . 


128 


1 


. 00 


22 


.29 


ATOM 


974 


O 


MET 


A 


427 


7 . 


363 


8 


.760 


3 . 


689 


1 


. 00 


22 


. 19 


ATOM 


975 


N 


LEU 


A 


428 


6 . 


285 


7 


.985 


1 . 


868 


1 


. 00 


21 


. 75 


ATOM 


976 


CA 


LEU 


A 


428 


7 . 


506 


7 


.892 


1 . 


075 


1 


. 00 


22 


. 91 


ATOM 


977 


CB 


LEU 


A 


428 


7 . 


202 


7 


.252 


-0 . 


287 


1 


. 00 


18 


.47 


ATOM 


978 


CG 


LEU 


A 


428 


6 . 


910 


5 


.747 


-0 . 


176 


1 


. 00 


19 


.24 


ATOM 


979 


CD1 


LEU 


A 


428 


6 . 


278 


5 


.222 


-1 . 


468 


1 


.00 


16 


. 82 


ATOM 


980 


CD 2 


LEU 


A 


428 


8 . 


204 


5 


. 010 


0 . 


131 


1 


. 00 


16 


. 23 


ATOM 


981 


C 


LEU 


A 


428 


8 . 


148 


9 


.269 


0 . 


902 


1 


. 00 


23 


. 98 


ATOM 


982 


O 


LEU 


A 


428 


9 . 


366 


9 


.416 


1 . 


034 


1 


. 00 


23 


. 06 


ATOM 


983 


N 


LEU 


A 


429 


7 . 


328 


10 


.281 


0 . 


628 


1 


. 0 0 


23 


. 91 


ATOM 


984 


CA 


LEU 


A 


429 


7 . 


837 


11 


. 642 


0 . 


462 


1 


. 00 


26 


. 29 


ATOM 


985 


CB 


LEU 


A 


429 


6 . 


714 


12 


. 571 


-0 . 


003 


1 


. 00 


27 


.47 


ATOM 


986 


CG 


LEU 


A 


429 


6". 


331 


12 


.411 


-1 . 


476 


1 


. 00 


30 


. 78 


ATOM 


987 


CD1 


LEU 


A 


429 


5 . 


022 


13 


.139 


-1 . 


751 


1 


. 00 


34 


. 75 


ATOM 


988 


CD 2 


LEU 


A 


429 


7 . 


449 


12 


. 952 


-2 . 


350 


1 


. 00 


31 


. 96 


ATOM 


989 


C 


LEU 


A 


429 


8 . 


425 


12 


.166 


1. 


776 


1 


.00 


25 


. 83 


ATOM 


990 


0 


LEU 


A 


429 


9 . 


482 


12 


.808 


1 . 


793 


1 


. 00 


26 


.42 


ATOM 


991 


N 


ALA 


A 


430 


7 . 


734 


11 


.890 


2 . 


877 


1 


. 00 


26 


.45 


ATOM 


992 


CA 


ALA 


A 


430 


8 . 


201 


12 


.333 


4 . 


185 


1 


. 00 


26 


. 11 


ATOM 


993 


CB 


ALA 


A 


430 


7 . 


214 


11 


.909 


5 . 


265 


1 


. 00 


23 


. 13 


ATOM 


994 


C 


ALA 


A 


430 


9 . 


577 


11 


.742 


4 . 


462 


1 


. 00 


25 


. 01 


ATOM 


995 


0 


ALA 


A 


430 


10 . 


455 


12 


.409 


5 . 


005 


1 


. 00 


24 


.31 


ATOM 


996 


N 


THR 


. A 


431 


9. 


767 


10 


.486 


4 . 


074 


1 


.00 


25 


. 25 


ATOM 


997 


CA 


THR 


A 


431 


11 . 


046 


9 


.825 


4 . 


294 


1 


. 00 


22 


.78 


ATOM 


998 


CB 


THR 


A 


431 


10 . 


973 


8 


.323 


3 . 


962 


1 


. 00 


21 


.36 


ATOM 


999 


OG1 


THR 


A 


431 


9 . 


924 


7 


.727 


4 . 


727 


1 


. 00 


20 


.27 


ATOM 


1000 


CG2 


THR 


A 


431 


12 . 


291 


7 


.633 


4 . 


299 


1 


. 00 


19 


. 99 


ATOM 


1001 


C 


* THR 


A 


431 


12 . 


103 


10 


.477 


3 . 


429 


1 


. 00 


23 


. 73 


ATOM 


1002 


0 


THR 


A 


431 


13 . 


234 


10 


.667 


3 . 


868 


1 


. 00 


19 


. 60 


ATOM 


1003 


N 


SER 


A 


432 


11. 


736 


10 


. 819 


2 . 


197 


1 


. 00 


24 


. 32 


ATOM 


1004 


CA 


SER 


A 


432 


12 . 


676 


11 


.479 


1 . 


301 


1 


. 00 


26 


. 96 


ATOM 


1005 


CB 


SER 


A 


432 


12 . 


067 


11 


. 650 


-0 . 


093 


1 


. 00 


28 


. 70 


ATOM 


1006 


OG 


SER 


A 


432 


13 . 


084 


11 


.930 


-1. 


039 


1 


. 00 


33 


.42 


ATOM 


1007 


C 


SER 


A 


432 


13 . 


033 


12 


. 850 


1 . 


876 


1 


.00 


27 


. 92 


ATOM 


1008 


0 


SER 


A 


432 


14. 


176 


13 


.294 


1. 


779 


1 


. 00 


30 


. 78 


ATOM 


1009 
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ATOM 


1410 


CG 


LEU 


A 


486 


,26 


.508 


2 


.921 


8 


.486 


1 


.00 


23 


.50 




ATOM 


1411 


CD1 


LEU 


A 


486 


25 


.210 


2 


.550 


9 


.183 


1 


. 00 


22 


. 03 
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15 



20 
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35 
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45 
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55 



60 



ATOM 


1412 


CD 2 


LEU 


A 


486 


26 


.309 


4 


. 128 


7 


. 577 


1 


. 00 


21 


.35 


ATOM 


1413 


C 


LEU 


A 


486 


29 


. 662 


4 


. 519 


9 


. 960 


1 


. 00 


27 


.36 


ATOM 


1414 


O 


LEU 


A 


486 


29 


.745 


5 


. 710 


9 


. 669 


1 


. 00 


25 


. 87 


ATOM 


1415 


N 


ILE 


A 


487 


30 


. 151 


4 


. 015 


11 


. 088 


1 


. 00 


27 


. 88 


ATOM 


1416 


CA 


ILE 


A 


487 


30 


. 843 


4 


.857 


12 


. 055 


1 


. 00 


28 


. 40 


ATOM 


1417 


CB 


ILE 


A 


487 


31 


.203 


4 


. 054 


13 


. 332 


1 


. 00 


26 


. 74 


ATOM 


1418 


CG2 


ILE 


A 


487 


32 


.255 


4 


.803 


14 


. 154 


1 


. 00 


27 


. 54 


ATOM 


1419 


CGI 


ILE 


A 


487 


29 


. 937 


3 


.813 


14 


. 163 


1 


. 00 


25 


. 93 


ATOM 


1420 


CD1 


ILE 


A 


487 


29 


.237 


5 


. 088 


14 


. 624 


1 


. 00 


23 


. 42 


ATOM 


1421 


C 


ILE 


A 


487 


32 


.125 


5 


.393 


11 


.412 


1 


. 00 


28 


• 89 


ATOM 


1422 


O 


ILE 


A 


487 


32 


.497 


6 


.554 


11 


. 602 


1 


. 00 


29 


. 85 


ATOM 


1423 


N 


HIS 


A 


488 


32 


. 791 


4 


.533 


10 


. 649 


1 


. 00 


29 


. 71 


ATOM 


1424 


CA 


HIS 


A 


488 


34 


. 031 


4 


.898 


9 


. 967 


1 


. 00 


34 


. 12 


ATOM 


1425 


CB 


HIS 


A 


488 


34 


.585 


3 


.691 


9 


. 207 


1 


. 00 


36 


. 61 


ATOM 


1426 


CG 


HIS 


A 


488 


35 


.799 


3 


. 997 


8 


.385 


1 


. 00 


42 


. 74 


ATOM 


1427 


CD 2 


HIS 


A 


488 


35 


. 970 


4 


.089 


7 


. 045 


1 


. 00 


43 


- 12 


ATOM 


1428 


ND1 


HIS 


A 


488 


37 


. 034 


4 


.239 


8 


. 946 


1 


• oo 


43 


. 13 


ATOM 


1429 


CE1 


HIS 


A 


488 


37 


. 913 


4 


.466 


7 


. 987 


1 


. 00 


43 


.40 


ATOM 


1430 


NE2 


HIS 


A 


488 


37 


.293 


4 


.381 


6 


. 825 


1 


. 00 


45 


. 63 


ATOM 


1431 


C 


HIS 


A 


488 


33 


.799 


6 


. 051 


8 


. 998 


1 


• 00 


32 


. 74 


ATOM 


1432 


O 


HIS 


A 


488 


34 


. 577 


7 


. 004 


8 


. 955 


1 


. 00 


31 


. 06 


ATOM 


1433 


N 


LEU 


A 


489 


32 


. 721 


5 


. 958 


8 


. 223 


1 


. 0 0 


33 


.56 


ATOM 


1434 


CA 


LEU 


A 


489 


32 


.384 


6 


. 992 


7 


. 258 


1 


. 00 


30 


. 78 


ATOM 


1435 


CB 


LEU 


A 


489 


31 


. 145 


6 


.587 


6 


. 464 


1 


. 00 


34 


. 67 


ATOM 


1436 


CG 


LEU 


A 


489 


31 


.310 


5 


.353 


5 


. 574 


1 


. 00 


34 


. 73 


ATOM 


1437 


CD1 


LEU 


A 


489 


29 


. 945 


4 


.856 


5 


. 125 


1 


. 00 


33 


. 2 1 


ATOM 


1438 


CD 2 


LEU 


A 


489 


32 


. 183 


5 


. 701 


4 


. 378 


1 


. 00 


35 


. 92 


ATOM 


1439 


C 


LEU 


A 


489 


32 


.124 


8 


.320 


7 


. 954 


1 


. 00 


33 


. 97 


ATOM 


1440 


O 


LEU 


A 


489 


32 


.587 


9 


.365 


7 


.507 


1 


. 00 


33 


. 22 


ATOM 


1441 


N 


MET 


A 


490 


31 


.387 


8 


.274 


9 


. 058 


1 


. 00 


31 


. 33 


ATOM 


1442 


CA 


MET 


A 


490' 


31 


. 056 


9 


.482 


9 


. 801 


1 


. 00 


30 


. 61 


ATOM 


1443 


CB 


MET 


A 


490 


30 


. 000 


9 


.161 


10 


. 862 


1 


. 00 


32 


. 34 


ATOM 


1444 


CG 


MET 


A 


490 


28 


.607 


8 


.940 


10 , 


. 289 


1 


. 00 


30 


. 71 


ATOM 


1445 


SD 


MET 


A 


490 


27 


.457 


8 


.247 


11. 


.496 


1 


. 00 


31 


. 14 


ATOM 


1446 


CE 


MET 


A 


490 


26 


.321 


7 


.408 


10 , 


. 418 


1 


. 00 


30 


. 36 


ATOM 


1447 


C 


MET 


A 


490 


32 


.287 


10 


.108 


10 , 


.455 


1 


. 00 


32 


. 22 


ATOM 


1448 


O 


MET 


A 


490 


32, 


.412 


11 


.330 


10 . 


. 517 


1 


. 00 


28 . 


. 25 


ATOM 


1449 


N 


ALA 


A 


491 


33 , 


. 184 


9 


.262 


10 . 


. 949 


1 , 


. 00 


33 , 


. 81 


ATOM 


1450 


CA 


ALA 


A 


491 


34. 


.407 


9 


.730 


11 . 


.585 


1 , 


.00 


39 


. 92 


ATOM 


1451 


CB 


ALA 


A 


491 


35, 


.16 8 


8 


.554 


12 . 


. 185 


1 , 


. 00 


37 , 


. 22 


ATOM 


1452 


C 


ALA 


A 


491 


35, 


.275 


10 


.445 


10 . 


.550 


1 . 


. 00 


42 , 


. 68 


ATOM 


1453 


O 


ALA 


A 


491 


35, 


.865 


11 


.487 


10, 


. 838 


1 . 


. 00 


45 , 


. 32 


ATOM 


1454 


N 


LYS 


A 


492 


35. 


,339 


9 . 


.-876 


9 . 


347 


1 . 


. 00 


44 , 


.39 


ATOM 


1455 


CA 


LYS 


A 


492 


36 . 


. 122 


10 , 


.440 


8 . 


248 


1 . 


.00 


44 . 


. 80 


ATOM 


1456 


CB 


LYS 


A 


492 


36. 


.136 


9 , 


.477 


7 . 


052 


1 . 


. 00 


46 . 


. 96 


ATOM 


1457 


CG 


LYS 


A 


492 


37 . 


490 


8 . 


.840 


6 . 


744' 


1 . 


. 00 


47 . 


.20 


ATOM 


1458 


CD 


LYS 


A 


492 


37. 


390 


7 , 


.830 


5 . 


595 


1 , 


.00 


45 . 


. 71 


ATOM 


1459 


CE 


LYS 


A 


492 


38. 


631 


6. 


.937 


5 . 


518 


1 . 


00 


45 . 


.55 


ATOM 


1460 


NZ 


LYS 


A 


492 


38. 


357 


5 , 


.577 


4 . 


948 


1 . 


00 


36 . 


28 


ATOM 


1461 


C 


LYS 


A 


492 


35. 


534 


11. 


.780 


7 . 


809 


1 . 


00 


45 . 


61 


ATOM 


1462 


O 


LYS 


A 


492 


36. 


227 


12 . 


.604 


7 . 


215 


1 . 


00 


46. 


18 


ATOM 


1463 


N 


ALA 


A 


493 


34. 


254 


11. 


.992 


8 . 


100 


1 . 


00 


43 . 


75 


ATOM 


1464 


CA 


ALA 


A 


493 


33 . 


590 


13 . 


238 


7. 


728 


1 . 


00 


42 . 


42 


ATOM 


1465 


CB 


ALA 


A 


493 


32 . 


097 


13 . 


001 


7 . 


528 


1 . 


00 


40 . 


92 


ATOM 


1466 


C 


ALA 


A 


493 


33 . 


816 


14 . 


305 


8 . 


796 


1 . 


00 


41 . 


78 


ATOM 


1467 


O 


ALA 


A 


493 


33 . 


277 


15 . 


410 


8 . 


707 


1 . 


00 


40. 


76 


ATOM 


1468 


N 


GLY 


A 


494 


34. 


604 


13 . 


960 


9. 


811 


1 . 


00 


41 . 


01 
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5 


ATOM 


1469 


CA 


GLY 


A 


494 


34 .903 


14 . 904 10 . 873 


1.00 




ATOM 


1470 


C 


GLY 


A 


494 


33 .857 


15 . 060 11 . 965 


1 . 00 




ATOM 


1471 


O 


GLY 


A 


494 


33 . 916 


16 . 011 12 . 747 


1 . 00 




ATOM 


1472 


N 


LEU 


A 


495 


32 . 905 


14.138 12.043 


1 . 00 




ATOM 


1473 


CA 


LEU 


A 


495 


31.876 


14 . 248 13 . 068 


1 . 00 


10 


ATOM 


1474 


CB 


LEU 


A 


495 


30 . 713 


13 . 304 12 . 769 


1 . 00 




ATOM 


1475 


CG 


LEU 


A 


495 


29 ,540 


13 .901 11. 988 


1 . 00 




ATOM 


1476 


CD1 


LEU 


A 


495 


29.976 


14.170 10.553 


1 . 00 




ATOM 


1477 


CD 2 


LEU 


A 


495 


28.349 


12.943 12.026 


1 . 00 




ATOM 


1478 


C 


LEU 


A 


495 


32 .461 


13.923 14.431 


1. 00 


15 


ATOM 


1479 


O 


LEU 


A 


495 


33 .347 


13 . 074 14 . 544 


1 . 00 




ATOM 


1480 


N 


THR 


A 


496 


31.979 


14.604 15.459 


1 . 00 




ATOM 


1481 


CA 


THR 


A 


496 


32 .462 


14.350 16. 812 


1 . 00 




ATOM 


1482 


CB 


THR 


A 


496 


31 . 925 


15 .375 17 . 829 


1 . 00 




ATOM 


1483 


OG1 


THR 


A 


496 


30 . 498 


15 . 263 17 . 908 


1 . 00 


20 


ATOM 


1484 


CG2 


THR 


A 


496 


32 . 315 


16.797 17.434 


1. 00 




ATOM 


1485 


C 


THR 


A 


496 


31 . 933 


12 . 987 17 . 210 


1. 00 




ATOM 


1486 


O 


THR 


A 


496 


31. 081 


12.427 16.521 


1 . 00 




ATOM 


1487 


N 


LEU 


A 


497 


32 .429 


12 .452 18 . 319 


1.00 




ATOM 


1488 


CA 


LEU 


. A 


497 


31 . 965 


11.151 18.786 


1 . 00 


25 


ATOM 


1489 


CB 


LEU 


A 


497 


32 . 689 


10 .760 20 . 074 


1 . 00 




ATOM 


1490 


CG 


LEU 


A 


497 


33 . 714 


9 . 640 19 . 896 


1 . 00 




ATOM 


1491 


CD1 


LEU 


A 


497 


34 . 755 


9.692 21. 008 


1 . 00 




ATOM 


1492 


CD 2 


LEU 


A 


497 


32 . 988 


8.305 19 . 884 


1 . 00 




ATOM 


1493 


C 


LEU 


A 


497 


30 .455 


11 . 198 19 . 026 


1 .00 


30 


ATOM 


1494 


O 


LEU 


A 


497 


29 . 712 


10.350 18.534 


1 . 00 




ATOM 


1495 


N 


GLN 


A 


498 


30 . 006 


12.202 19.773 


1 . 00 




ATOM 


1496 


CA 


GLN 


A 


498 


28 . 586 


12.348 20.062 


1 . 00 




ATOM 


1497 


CB 


GLN 


A 


498 


28 .344 


13.566 20.951 


1 .00 




ATOM 


1498 


CG 


GLN 


A 


498 


26 . 894 


13 .796 21.341 


1 . 00 


35 


ATOM 


1499 


CD 


GLN 


A 


498 


26.712 


15 . 130 22 . 015 


1 . 00 




ATOM 


1500 


OE1 


GLN 


A 


498 


27.363 


16 . 112 21 . 686 


1 . 00 




ATOM 


1501 


NE2 


GLN 


A 


4 98 


25 . 809 


15.176 23 . 008 


1 . 00 




ATOM 


1502 


C 


GLN 


A 


498 


27 . 776 


12 .476 18 .773 


1 . 00 




ATOM 


1503 


O 


GLN 


A 


498 


26 . 682 


11 . 927 18 . 665 


1 . 00 


40 


ATOM 


1504 


N 


GLN 


A 


499 


28 .311 


13 . 196 17 . 793 


1 . 00 




ATOM 


1505 


CA 


GLN 


A 


499 


27 . 603 


13 .362 16 . 524 


1 . 00 




ATOM 


1506 


CB 


GLN 


A 


499 


28 .292 


14.420 15.661 


1 . 00 




ATOM 


1507 


CG 


GLN 


A 


499 


28 . 135 


15 . 840 16 . 191 


1.00 




ATOM 


1508 


CD 


GLN 


A 


499 


28 . 930 


16.849 15.389 


1 . 00 


45 


ATOM 


1509 


OE1 


GLN 


A 


499 


29 . 956 


16.518 14.795 


1 . 00 




ATOM 


1510 


NE2 


GLN 


A 


499 


28 .457 


18.089 15.364 


1. 00 




ATOM 


1511 


C 


GLN 


A 


499 


27 . 529 


12.047 15.753 


1 . 00 




ATOM 


1512 


O 


GLN 


A 


499 


26 . 567 


11.793 15.032 


1 . 00 




ATOM 


1513 


N 


GLN 


A 


500 


28 - 550 


11.214 15.903 


1 . 00 


50 


ATOM 


1514 


CA 


GLN 


A 


500 


28 .577 


9.937 15.216 


1 . 00 




ATOM 


1515 


CB 


GLN 


A 


500 


29 . 933 


9.276 15.406 


1 . 00 




ATOM 


1516 


CG 


GLN 


A 


500 


31 . 012 


9.839 14.508 


1 . 00 




ATOM 


1517 


CD 


GLN 


A 


500 


32 . 371 


9 .370 14 . 930 


1 . 00 




ATOM 


1518 


OE1 


GLN 


A 


500 


32 . 612 


8 . 194 15 . 141 


1 . 00 


55 


ATOM 


1519 


NE2 


GLN 


A 


500 


33 .301 


10.324 15.082 


1.00 




ATOM 


1520 


C 


GLN 


A 


500 


27 . 459 


9.017 15.711 


1 . 00 




ATOM 


1521 


O 


GLN 


A 


500 


26 . 700 


8 .469 14 . 908 


1 . 00 




ATOM 


1522 


N 


HIS 


A 


501 


27.357 


8 . 864 17 . 029 


1.00 




ATOM 


1523 


CA 


HIS 


A 


501 


26 .327 


8.021 17.631 


1 . 00 


60 


ATOM 


1524 


CB 


HIS 


A 


501 


26 . 535 


7.919 19.145 


1.00 




ATOM 


1525 


CG 


HIS 


A 


501 


27 . 892 


7.420 19.535 


1. 00 



41.63 
41.18 
38 .22 
39.53 

38 . 91 

39 .20 

40 .73 
37.80 
40 . 94 

36 .01 

34 . 85 

37 .52 

35 .45 
37 . 55 

32 . 93 

36 . 16 
35.67 
34 .34 

34 . 88 

35 . 67 
41.10 
45 .27 
45.09 
47 . 77 

33 . 72 

33 .20 
30 . 82 
31.47 
30 . 51 

34 .38 
38.60 
42 . 92 
40 . 02 
30 .47 
30 . 85 

29 .52 

30 .24 
30.20 
31 . 60 
31 . 61 
30 .66 
34 .17 

29 .40 

30 . 04 
25.67 
29.30 
31. 52 
33 . 05 
34.84 
36 .47 
38 . 25 
27 . 98 
24 . 84 
26 .20 
27.63 
27.97 
34.27 
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60 



ATOM 


1526 


CD2 


HIS 


A 


501 


28 . 


. 726 


6 


. 540 


18 . 


.931 


1. 


. 00 


36 , 


. 10 


ATOM 


1527 


ND1 


HIS 


A 


501 


28 . 


. 541 


7 


. 844 


20 . 


.676 


1, 


.00 


31, 


. 81 


ATOM 


1528 


CE1 


HIS 


A 


501 


29. 


,716 


7 


. 244 


20 . 


.758 


1, 


. 00 


34 


. 89 


ATOM 


1529 


NE2 


HIS 


A 


501 


29, 


.854 


6 


. 448 


19 . 


.712 


1, 


. 00 


37 


. 46 


ATOM 


1530 


C 


HIS 


A 


501 


24 , 


.935 


8 


.572 


17 


.348 


1, 


. 00 


24 


. 93 


ATOM 


1531 


O 


HIS 


A 


501 


23 < 


. 998 


7 


. 815 


17 


. 107 


1 


. 00 


26 


. 73 


ATOM 


1532 


N 


GLN 


A 


502 


24 


.796 


9 


. 892 


17 


.379 


1 


. 00 


22 


. 79 


ATOM 


1533 


CA 


GLN 


A 


502 


23 


.504 


10 


.498 


17 


.119 


1 


.00 


26 


. 14 


ATOM 


1534 


CB 


GLN 


A 


502 


23 


.554 


12 


. 006 


17 


.371 


1 


. 00 


22 


.36 


ATOM 


1535 


CG 


GLN 


A 


502 


23 


.460 


12 


.378 


18 


. 848 


1 


.00 


26 


. 19 


ATOM 


1536 


CD 


GLN 


A 


502 


23 


.589 


13 


.875 


19 


.089 


1 


. 00 


28 


.67 


ATOM 


1537 


OE1 


GLN 


A 


502 


23 


.632 


14 


. 663 


18 


.149 


1 


. 00 


28 


.40 


ATOM 


1538 


NE2 


GLN 


A 


502 


23 


. 651 


14 


.268 


20 


.355 


1 


. 00 


24 


. 72 


ATOM 


1539 


C 


GLN 


A 


502 


23 


. 056 


10 


. 221 


15 


. 685 


1 


.00 


26 


. 19 


ATOM 


1540 


O 


GLN . 


A 


502 


21 


. 913 


9 


.822 


15 


.453 


1 


. 00 


24 


.09 


ATOM 


1541 


N 


ARG 


A 


503 


23 


. 955 


10 


. 429 


14 


.727 


1 


. 00 


24 


.88 


ATOM 


1542. 


CA 


ARG 


A 


503 


23 


. 630 


10 


. 196 


13 


.326 


1 


. 00 


25 


.25 


ATOM 


1543 


CB 


ARG 


A 


503 


24 


.772 


10 


. 668 


12 


.418 


1 


. 00 


27 


. 63 


ATOM 


1544 


CG 


ARG 


A 


503 


24 


.432 


10 


. 563 


10 


. 932 


1 


. 00 


28 


. 75 


ATOM 


1545 


CD 


ARG 


A 


503 


25 


.479 


11 


.222 


10 


.056 


1 


. 00 


27 


. 72 


ATOM 


1546 


NE 


ARG 


A 


503 


25 


. 072 


11 


.214 


8 


.654 


1 


. 00 


29 


.35 


ATOM 


1547 


CZ 


ARG 


A 


503 


24 


.279 


12 


. 126 


8 


. 105 


1 


. 00 


25 


. 84 


ATOM 


1548 


NH1 


ARG 


A 


503 


23 


. 804 


13 


. 120 


8 


. 840 


1 


. 00 


27 


.35 


ATOM 


1549 


NH2 


ARG 


A 


503 


23 


. 962 


12 


. 044 


6 


. 820 


1 


. 00 


30 


. 63 


ATOM 


1550 


C 


ARG 


A 


503 


23 


.347 


8 


. 716 


13 


.065 


1 


. 00 


24 


.53 


ATOM 


1551 


O 


ARG 


A 


503 


22 


.425 


8 


.375 


12 


.321 


1 


. 00 


25 


.90 


ATOM 


1552 


N 


LEU 


A 


504 


24 


. 143 


7 


. 841 


13 


.672 


1 


.00 


23 


.00 


ATOM 


1553 


CA 


LEU 


A 


504 


23 


. 953 


6 


. 406 


13 


.496 


1 


. 00 


22 


.60 
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B 
B 
B 
B 
B 
B 
B 
B 
B 



600 

600 

600 

600 

600 

600 

600 

600 

600 

600 

600 

600 

600 

600 

600 

600 

601 

305 

305 

305 

305 

305 

306 

306 

306 

306 

306 

306 

306 

306 

307 

307 

307 

307 

307 

308 

308 

308 

308 

308 

308 

308 

308 

309 

309 

309 

309 

309 

309 

310 

310 

310 

310 

310 

310 

310 

310 



2.987 
1.597 
0 . 842 
-0 . 506 

I. 421 
2 .793 
5 . 671 
7 . 113 

7 . 541 

8 . 889 

9 . 814 
11.125 

9 .423 
8.066 
4 . 894 
4 . 959 

14 .781 
12 .321 
12 .672 
13 .701 
12 . 045 

II . 875 
12 . 193 
12 . 884 
11 . 884 

12 .221 
13 .304 

10 . 965 

13 . 660 
14 . 570 
13 .293 
13 . 971 
13 . 092 
15.303 
16. 196 
15.431 
16 . 643 
16 . 413 
16.315 
15.942 
15.287 
17 . 874 
19 .000 
17 . 669 
18 .796 
18.562 
17 .459 
19 . 072 
20 . 053 
18 .217 

18 .394 
17 .205 
16 .216 
16 . 040 
14 . 881 

19 .691 
20 . Ill 



0 . 
0 . 
0 , 
0 . 

1 . 

0 . 

1 , 

2 . 
-3 



■0 - 978 
•1.150 
■1.214 
-1 .419 
-1.099 
-0.929 
0 .461 
0 .561 
306 
,429 
.804 
.901 
. 066 
.937 
765 
468 
035- 
21 . 086 
22 . 102 
22 . 760 
23 . 521 
22 . 187 
21.293 
21. 133 
21.200 
20 .417 
21 . 144 
20 . 258 
19 - 819 
19 . 654 
18 . 881 
17 .589 
16 .584 
17 .719 
16 . 885 
18.769 
18 . 983 
20 . 100 
19.708 
18.239 
20 . 602 
19.297 
19 .102 
19 .775 
20 . 100 
21.447 
21.379 
19 . 028 
19 . 119 
18 . 012 
16.936 
15.969 
15.873 
17.219 
15.380 
16.174 
16 . 070 



-3 .784 
-3.684 



-4 . 871 
-4.824 
-6 .143 
-6 .230 
-5 .482 
-5 .809 
-7 . 131 
-7 .477 
-6.488 
-6 . 839 
-5 . 161 
-4 . 838 
-5 . 443 
-4 . 070 
-17 .739 
25 .295 
27.548 
27 . 702 

25 . 606 

26 . 251 

28 .484 

29 .757 

30 .913 
32.183 
32 . 966 
33 . 027 
29 . 803 
30 . 614 
28 . 933 
28 . 861 
28 . 143 
28 . 122 
28 . 274 
27 .320 
26 .542 
25 . 526 
24 . 051 
23 . 903 
23.375. 
27 . 385 
26 . 932 
28 . 608 
29.475 
30 .163 
31 . 046 
30.529 
31.269 
30 . 596 
31 .569 
31 .499 

32 .668 

33 .355 
32 . 138 
31.285 
30 . 139 



1. 
1 . 
1 . 
1. 
1. 
1. 
1 . 
1. 
1 , 
1 , 
1 
1 
1 
1 
1 
1 
1 
1 
1 



00 
00 
00 
00 
00 
00 
.00 
,00 
.00 
. 00 
. 00 
. 00 
.00 
. 00 
.00 
.00 
.00 
.00 
. 00 



1.00 



00 
00 
00 
00 
,00 
.00 
.00 
.00 
. 00 
. 00 
.00 
. 00 
.00 
.00 
.00 



1.00 
1.00 
1.00 
1.00 



1. 

1. 

1 . 

1. 

1, 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 

1 



00 
00 
00 
00 
00 
00 
.00 
, 00 
. 00 
. 00 
. 00 
. 00 
.00 
.00 
.00 
.00 
. 00 
.00 



23 . 92 
29 .77 
31.40 
33.36 
27 .01 
27 .40 

22 .39 
21. 75 
19 . 97 

23 . 81 
21.88 
22 .32 
19.74 
21.25 
21.47 
21 .38 

24 .10 
64 .27 
64 .37 
66 .90 

63 . 72 

64 .21 

63 . 09 
60 .98 
61.23 
62 . 23 
62 . 56 

64 .31 
58 .39 
58.56 
54.82 
50 . 62 
51.30 
46 . 84 
45 . 62 
43 .46 
43 . 01 
41.32 

43 .10 
40.51 
39.80 
42 . 11 

44 .34 
40 . 88 
42 .79 
41.25 
46.67 
42 . 60 
44 . 18 
39.44 

37 . 62 

38 . 84 
42 .43 
42 . 55 

39 . 69 
34. 11 
34.41 
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SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO_ 



_ 996001 4A3_IA> 



WO 99/060014 



PCTVUS99/06899 



5 


ATOM 


1982 


N 


THR 


B 


311 


20 . 339 


15 . 662 32 . 


326 


1 . 


00 


34 . 


04 




ATOM 


1983 


CA 


THR 


B 


311 


21.564 


14 . 888 32 . 


127 


1 . 


00* 


32 . 


34 




ATOM 


1984 


CB 


THR 


B 


311 


22.434 


14 . 824 33 . 


399 


1. 


00 


31. 


75 




ATOM 


1985 


OG1 


THR 


B 


311 


21.724 


14.116 34 . 


420 


1. 


00 


36 . 


20 




ATOM 


1986 


CG2 


THR 


B 


311 


22.782 


16 .212 33 . 


893 


1 . 


00 


31. 


05 


10 


ATOM 


1987 


C 


THR 


B 


311 


21.145 


13'- 460 31. 


790 


1 . 


00 


32 . 


37 




ATOM 


1988 


O 


THR 


B 


311 


19 . 967 


13 . 117 31 . 


899 


1 . 


00 


28 . 


16 




ATOM 


1989 


N 


ALA 


B 


312 


22.106 


12.628 31. 


396 


1. 


00 


33. 


23 




ATOM 


1990 


CA 


ALrA 


B 


312 


21. 811 


11.237 31. 


053 


1 . 


00 


35. 


63 




ATOM 


1991 


CB 


ALA 


B 


312 


23 .077 


10.527 30. 


577 


1. 


00 


34 . 


00 


15 


ATOM 


1992 


C 


ALA 


B 


312 


21.210 


10 .489 32 . 


240 


1 . 


00 


34 . 


29 




ATOM 


1993 


O 


ALA 


B 


312 


20.226 


9.766 32. 


089 


1 . 


00 


33 . 


10 




ATOM 


1994 


N 


ASP 


B 


313 


21 .800 


10.665 33. 


419 


1 . 


00 


33 . 


90 




ATOM 


1995 


CA 


ASP 


B 


313 


21.304 


9.994 34. 


615 


1 . 


00 


34 . 


19 




ATOM 


1996 


CB 


ASP 


B 


313 


22 .258 


10.219 35. 


788 


1 . 


00 


42 . 


09 


20 


ATOM 


1997 


CG 


ASP 


B 


313 


23 .494 


9.358 35. 


700 


1 . 


00 


44 . 


87 




ATOM 


1998 


OD1 


ASP 


B 


313 


24.586 


9.858 36. 


040 


1 . 


00 


51. 


57 




ATOM 


1999 


OD2 


ASP 


B 


313 


23 .377 


8.184 35. 


290 


1 . 


00 


46 . 


.79 




ATOM 


2000 


C 


ASP 


B 


313 


19 . 925 


10 .520 34 . 


. 971 


1 . 


00 


31 . 


.99 




ATOM 


2001 


O 


ASP 


B 


313 


19 . 056 


9.768 35. 


, 426 


1 . 


0 0 


32 . 


. 03 


25 


ATOM 


2002 


N 


GLN 


B 


314 


19 . 733 


11.819 34. 


, 763 


1 . 


, 00 


29 , 


,38 




ATOM 


2003 


CA 


GLN 


B 


314 


18 .458 


12.457 35. 


, 046 


1 . 


.00 


29 . 


. 73 




ATOM 


2004 


CB 


GLN 


B 


314 


18 . 562 


13 .966 34 . 


. 832 


1 . 


,00 


32 , 


. 88 




ATOM 


2005 


CG 


GLN 


B 


314 


18 . 970 


14.732 36, 


. 085 


1 - 


.00 


36 , 


.47 




ATOM 


2006 


CD 


GLN 


B 


314 


19 .213 


16.208 35, 


. 815 


1 , 


.00 


36 


.76 


30 


ATOM 


2007 


OE1 


GLN 


B 


314 


19.300 


16 .634 34, 


. 664 


1 . 


. 00 


38 


. 79 




ATOM 


2008 


NE2 


GLN 


B 


314 


19 . 327 


16.995 36, 


. 880 


1, 


.00 


39 


. 72 




ATOM 


2009 


C 


GLN 


B 


314 


17 .409 


11.873 34 


. 116 


1 


.00 


29 


. 11 




ATOM 


2010 


O 


GLN 


B 


314 


16 . 274 


11.620 34 


. 522 


1 


.00 


28 


. 82 




ATOM 


2011 


N 


MET 


B 


315 


17 . 801 


11.657 32 


. 864 


1 


.00 


27 


. 27 


35 


ATOM 


2012 


CA 


MET 


B 


315 * 


16 . 900 


11. 079 31 


. 872 


1 


.00 


30 


. 41 




ATOM 


2013 


CB 


MET 


B 


315 


17 .595 


11.029 30 


. 509 


1 


. 00 


30 


. 10 




ATOM 


2014 


CG 


MET 


B 


315 


16 . 787 


10 .345 29 


. 421 


1 


.00 


38 


. 02 




ATOM 


2015 


SD 


MET 


B 


315 


15 .252 


11.220 29 


. 065 


1 


.00 


41 


. 12 




ATOM 


2016 


CE 


MET 


B 


315 


15.890 


12 . 835 28 


. 611 


1 


.00 


39 


.32 


40 


ATOM 


2017 


C 


MET 


B 


315 


16 .490 


9.665 32 


.311 


1 


.00 


27 


.99 




ATOM 


2018 


O 


MET 


B 


315 


15 .302 


9.351 32 


.396 


1 


.00 


26 


.60 




ATOM 


2019 


N 


VAL 


B 


316 


17 .481 


8 .823 32 


. 598 


1 


.00 


27 


.26 




ATOM 


2020 


CA 


VAL 


B 


316 


17 .229 


7 .447 33 


. 027 


1 


.00 


24 


.54 




ATOM 


2021 


CB 


VAL 


B 


316 


18 .554 


6.708 33 


. 351 


1 


. 00 


26 


. 22 


45 


ATOM 


2022 


CGI 


VAL 


B 


316 


18 .272 


5.404 34 


. 096 


1 


. 00 


29 


.81 




ATOM 


2023 


CG2 


VAL 


B 


316 


19.302 


6.410 32 


. 074 


1 


. 00 


29 


. 75 




ATOM 


2024 


C 


VAL 


B 


316 


16.326 


7.389 34 


.258 


1 


. 00 


27 


. 22 




ATOM 


2025 


O 


VAL 


B 


316 


15.397 


6.579 34 


.318 


1 


.00 


25 


. 55 




ATOM 


2026 


N 


SER 


B 


317 


16.601 


8.243 35 


.242 


1 


.00 


24 


.40 


50 


ATOM 


2027 


CA 


SER 


B 


317 


15.799 


8.268 36 


.460 


1 


. 00 


27 


. 63 




ATOM 


2028 


CB 


SER 


B 


317 


16.358 


9.294 37 


.451 


1 


.00 


31 


. 68 




ATOM 


2029 


OG 


SER 


B 


317 


17 .492 


8.771 38 


. 112 


1 


. 00 


39 


.97 




ATOM 


2030 


C 


SER 


B 


317 


14 .346 


8.600 36 


. 154 


1 


. 00 


26 


. 73 




ATOM 


2031 


O 


SER 


B 


317 


13 .434 


7.932 36 


. 648 


1 


.00 


25 


. 65 


55 


ATOM 


2032 


N 


ALA 


B 


318 


14.135 


9.634 35 


.342 


1 


. 00 


24 


. 19 




ATOM 


2033 


CA 


ALA 


B 


318 


12 .786 


10.049 34 


. 969 


1 


.00 


24 


. 17 




ATOM 


2034 


CB 


ALA 


B 


318 


12 . 850 


11.250 34 


. 022 


1 


.00 


21 


.44 




ATOM 


2035 


C 


ALA 


B 


318 


12. 038 


8.890 34 


.306 


1 


. 00 


21 


. 63 




ATOM 


2036 


O 


ALA 


B 


318 


10 . 902 


8.598 34 


. 648 


1 


. 00 


20 


.25 


60 


ATOM 


2037 


N 


LEU 


B 


319 


12.695 


8.225 33 


.364 


1 


.00 


23 


.37 




ATOM 


2038 


CA 


LEU 


B 


319 


12.098 


7.102 32 


. 652 


1 


.00 


25 


.42 
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BNSOOCID: <WO_ 



_9960014A3JA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



5 


ATOM 


2039 


CB 


LEU 


B 


319 


13 . 


05 0 


6 . 


635 


31 , 


. 548 


1 , 


.00 


22 , 


.03 




ATOM 


2040 


CG 


LEU 


B 


319 


13 . 


264 


7 , 


. 622 


30 . 


. 3 94 


1 , 


. 00 


20 


. 71 




TV m n * 

ATOM 


2041 


CD1 


LEU 


B 


319 


14 . 


146 


6 . 


. 995 


29 . 


.331 


1 , 


.00 


23 . 


. 60 




ATOM 


2042 


CD2 


LEU 


B 


319 


11 . 


918 


8 , 


. 020 


29 , 


. 803 


1 , 


. 00 


23 


. 82 




ATOM 


2043 


C 


LEU 


B 


319 


11 . 


729 


5 , 


. 926 . 


33 . 


. 564 


1 , 


. 00 


27 


. 26 


1 A 
1U 


ATOM 


2044 


O 


LEU 


B 


319 


10 . 


615 


5 , 


. 396 


33 


. 488 


1 


.00 


28 


. 91 




ATOM 


2045 


N 


LEU 


B 


320 


12 . 


, 656 


5 . 


. 516 


34 


. 426 


1 


.00 


26 


. 58 




ATOM 


2046 


CA 


LEU 


B 


320 


12 . 


.399 


4 . 


. 405 


35 


. 334 


1 


.00 


26 


. 73 




ATOM 


2 047 


CB 


LEU 


B 


320 


13 . 


. 657 


4 , 


. 075 


36 


. 145 


1 


.00 


26 


. 87 




ATOM 


2048 


CG 


LEU 


B 


320 


14 . 


. 846 


3 


. 460 


35 


. 398 


1 


.00 


26 


. 15 




ATOM 


2 049 


CD1 


LEU 


B 


320 


16 , 


. 053 


3 


. 375 


36 


. 330 


1 


.00 


28 


. 04 




ATOM 


2050 


CD2 


LEU 


B 


320 


14 , 


. 484 


2 


. 076 


34 


. 895 


1 


. 00 


26 


. 96 




tv m /*\ y<i it 

ATOM 


2051 


C 


LEU 


B 


320 


11 , 


> 249 


4 


. 722 


36 


. 290 


1 


.00 


29 


.19 




ATOM 


2052 


O 


LEU 


B 


320 


10 . 


. 449 


3 


. 849 


36 


. 631 


1 


.00 


26 


. 66 




ATOM 


2053 


N 


ASP 


B 


321 


11 . 


. 160 


5 


. 976 


36 


. 719 


1 


.00 


29 


. 72 


OA 
ZU 


ATOM 


2054 


CA 


ASP 


B 


321 


10 


. 112 


6 


. 371 


37 


. 647 


1 


. 00 


31 


.36 




ATOM 


2055 


CB 


ASP 


B 


321 


10 , 


. 494 


7 


. 683 


38 


. 336 


1 


.00 


36 


. 60 




ATOM 


2056 


CG 


ASP 


B 


. 321 


11 


. 407 . 


7 


. 461 


39 


. 535 


1 


.00 


46 


. 11 




ATOM 


2057 


OD1 


ASP 


B 


321 


10 


. 897 


7 


. 058 


40 


. 605 


1 


. 00 


46 


. 64 




ATOM 


2058 


OD2 


ASP 


B 


321 


12 


. 635 


7 


. 676 


39 


.402 


1 


.00 


45 


. 98 


25 


ATOM 


2059 


C 


ASP 


B 


321 


8 


. 742 


6 


.494 


3 6 


. 989 


1 


.00 


28 


.29 




ATOM 


2060 


O 


ASP 


B 


321 


7 


. 715 


6 


.432 


37 


. 661 


1 


.00 


27 


. 19 




ATOM 


2061 


N 


ALA 


B 


322 


8 


. 726 


6 


. 650 


35 


. 672 


1 


.00 


28 


.34 




ATOM 


2062 


CA 


ALA 


B 


322 


7 


. 469 


6 


.779 


34 


. 950 


1 


.00 


25 


.55 




ATOM 


2063 


CB 


ALA 


B 


322 


7 


. 668 


7 


.668 


33 


. 728 


1 


.00 


24 


. 11 


^0 


ATOM 


2064 


C 


ALA 


B 


322 


6 


. 911 


5 


.420 


34 


. 523 


1 


.00 


22 


. 80 




ATOM 


2065 


o 


ALA 


B 


322 


5 


. 810 


5 


.338 


33 


. 979 


1 


.00 


24 


. 54 




ATOM 


2066 


N 


GLU 


B 


323 


7 


. 662 


4 


.355 


34 


. 781 


1 


.00 


20 


. 16 




ATOM 


2067 


CA 


GLU 


B 


323 


7 


. 229 


3 


. 021 


34 


.386 


1 


.00 


21 


.44 




ATOM 


2068 


CB 


GLU 


B 


323 


8 


. 196 


1 


. 982 


34 


- 938 


1 


.00 


23 


. 72 


35 


ATOM 


2069 


CG 


GLU 


B 


323 


9 


.393 


1 


. 746 


34 


. 024 


1 


.00 


23 


. 58 




ATOM 


2070 


CD 


GLU 


B 


323 


8 


.988 


1 


.134 


32 


. 685 


1 


.00 


25 


. 23 




ATOM 


2071 


OE1 


GLU 


B 


323 


8 


. 852 


1 


.881 


31 


. 692 


1 


. 00 


21 


. 74 




ATOM 


2072 


OE2 


GLU 


B 


323 


8 


• 809 


-0 


. 095 


32 


. 624 


1 


.00 


25 


.49 




ATOM 


2073 


C 


GLU 


B 


323 


5 


. 796 


2 


. 696 


34 


. 810 


1 


.00 


22 


. 35 


41) 


ATOM 


2074 


o 


GLU 


B 


323 


5 


.409 


2 


. 926 


35 


. 951 


1 


.00 


22 


.34 




ATOM 


2075 


N 


PRO 


B 


324 


4 


. 986 


2 


. 165 


33 


. 880 


1 


.00 


19 


. 10 




ATOM 


2076 


CD 


PRO 


B 


324 


5 


. 286 


1 


. 806 


32 


. 483 


1 


.00 


19 


. 11 




ATOM 


2077 


CA 


PRO 


B 


324. 


3 


. 607 


1 


. 839 


34 


. 242 


1 


.00 


22 


. 04 




ATOM 


2078 


CB 


PRO 


B 


324 


2 


. 919 


1 


. 658 


32 


. 893 


1 


.00 


21 


. 96 




ATOM 


2079 


CG 


PRO 


B 


324 


4 


. 015 


1 


. 137 


32 


. 015 


1 


.00 


24 


. 13 




ATOM 


2080 


C 


PRO 


B 


324 


3 


. 619 


0 


. 556 


35 


. 060 


1 


.00 


23 


.44 




ATOM 


2081 


o 


PRO 


B 


324 


' 4 


. 590 


-0 


.200 


35 


. 028 


1 


. 00 


22 


.20 




ATOM 


20 82 


N 


PRO 


B 


325 


2 


. 540 


0 


. 287 


35 


. 801 


1 


. 00 


24 


. 88 




TV m^*\R/T 

ATOM 


2 0 83 


CD 


PRO 


B 


325 


1 


.299 


1 


.068 


35 


. 945 


1 


. 00 


26 


. 67 


^A 


ATOM 


2084 


CA ^ 


PRO 


B 


325 


2 


. 520 


-0 


. 940 


36 


. 603 


1 


. 00 


25 


. 10 




ArOM 


2 0 85 


CB 


PRO 


B 


3 25 


1 


. 394 


-0 


.691 


37 


. 595 


1 


. 00 


27 


. 09 




ATOM 


2 0 86 


CG 


PRO 


B 


325 


0 


. 448 


0 


. 205 


36 


. 854 


1 


. 00 


26 


. 87 




TV fTl^MVJT 

ATOM 


20 87 


C 


PRO 


B 


325 


2 


. 270 


-2 


. 192 


35 


. 776 


1 


. 00 


25 


. 77 




7v nn/*\iv^r 

ATOM 


20 88 


O 


PRO 


B 


325 


1 


. 853 


-2 


. 118 


34 


- 617 


1 


. 00 


21 


. 69 


55 


TV m/""vmr 
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22 .203 
21.913 

22 . 650 
22 . 119 

23 ,237 
23 .201 
23 .504 

23 . 982 
23 .256 
23 . 125 

24 . 125 
21.899 
21.670 
20 . 203 
19.878 
19 . 993 
22 . 017 
22 . 574 
21.686 
21.969 
21.346 
19 . 878 
19 .295 
19 . 763 
23 .477 
23 . 984 
24 . 191 
25 .64 0 
26 . 207 
27 .725 
25.857 
26 .538 
26 .275 
27.200 
25 .755 
26 .251 
25 .450 

25 . 818 
25 .245 

26 .869 
26 .927 

25 . 953 

26 . 123 
27.071 
24 . 949 
24 . 715 
23 . 298 
22 . 158 
20 . 827 
22 . 143 
25 . 718 
26 . 241 



PCT/US99/06899 

7.722*24.640 1.00 25.55 
8.896 22.722 1.00 21.89 
8.176 22.141 1.00 22.67 
8.662 20.695 1.00 23.34 
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7.520 23.381 1.00 27.01 
9.703 23.404 1.00 22.88 
10.058 24.276 1.00 24.93 
11.546 24.652 1.00 28.69 
12.464 23.485 1.00 29.19 
11.984 22.437 1.00 29.63 
13.68123.627 1.00 32.02 
9.249 25.567 1.00 24.40 
8.780 26.103 1.00 25.60 
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9.914 27.830 1.00^ 24.28 
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6.522 24.051 1.00 24.54 
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5.463 29.292 1.00 26.85 
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3.127 29.788 1.00 27.07 
3.909 30.445 1.00 31.71 
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26 .41 

23 . 53 

24 . 60 

26 .'46 

29 .45 

25 .43 

25 .80 

23 . 59 

27 . 74 

26 .32 
26 . 75 

28 .12 

26 .36 

27 . 00 
26 . 97 

24 .25 

30 .37 
32 .34 

36 .61 
42 .73 
47 .39 
46 .96 
31.27 

30 .53 

29 .02 

31 .21 

30 . 98 

32 . 14 

33 .78 
32 .27 

37 . 73 
35 . 83 
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B 


ATOM 


3811 


C 


HIS 
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3860 
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3861 
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ATOM 
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-2.796 1.968 13.269 1.00 34.54 

-2.698 2.553 14.486 1.00 30.23 

-1.502 1.755 12.840 1.00 34.23 

-0.659 2.193 13.760 1.00 36.72 

-1.360 2.681 14.768 1.00 31.48 

-5.649 2.286 10.735 1.00 43.69 

-5.178 2.152 9.606 1.00 46.04 

-6.919 2.020 11.019 1.00 48.35 

-7.843 1.551 9.993 1.00 54.74 

-8.522 0.267 10.452 1.00 54.66 

-8.886 2.. 619 9.681 1.00 59.94 

-8.580 3.812 9.672 1.00 62.81 

-10.116 2.186 9.422 1.00 64.81 

-11.204 3.109 9.112 1.00 67.59 

-12.478 2.327 8.799 1.00 68.06 

-11.449 4.069 10.275 1.00 69.12 

-11.451 5.297 10.036 1.00 68.96 

-11.634 3.579 11.412 1.00 70*70 

-4.547 -6.077 22.000 1.00 18.55 

-3.163 -6.365 21.467 1.00 17.72 

-2.897 -7.853 21.381 1.00 21.17 

-3.719 -8.551 20.374 1.00 22.05 

-3.405 -8.48118.998 1.00 21.32 

-4.239 -9.095 18.063 1.00 21.61 

-5.388 -9.771 18.509 1.00 24.89 

-6.244 -10.339 17.600 1.00 24.94 

-5.718 -9.858 19.860 1.00 24.08 

-4.877 -9.240 20.791 1.00 24.67 

-1.998 -8.460 22.190 1.00 16.67 

-1.330 -7.834 23.325 1.00 15.39 

-2.054 -7.642 24.522 1.00 17.62 

-1.433 -7.072 25.634 1.00 16.16 

-0.077 -6.685 25.542 1.00 20.04 

0.509 -6.113 26.655 1.00 15.55 

0.669 -6.866 24.353 1.00 18.94 

0.035 -7.440 23.241 1.00 15.20 

-1.642 -9.903 21.942 1.00 17.61 

-0.440 -10.009 20.998 1.00 11.63 

-4.997 -22.994 25.273 1.00 55.80 

-4.789 -24.187 25.003 1.00 55.56 

-4.798 -22.559 26.552 1.00 56.04 

-5.468 -21.960 24.264 1.00 57.04 

-15.278 -5.124 10.243 1.00 87.39 

-15.852 -5.086 9.064 1.00 87.68 

-15.832 -4.29111.201 1.00 86.22 

-14.207 -5.886 10.628 1.00 87.65 

9.818 -20.030 -2.211 1.00 63.34 

10.133 -20.267 -4.689 1.00 63.49 

11.204 -20.840 -4.472 1.00 63.87 

7.944 -19.563 -3.758 1.00 65.42 

9.424 -19.484 -3.586 1.00 64.86 

9.533 -20.281 -5.875 1.00 62.00 

10.101 -20.999 -7.009 1.00 60.81 

8.980 -21.540 -7.901 1.00 61.76 

11.050 -20.127 -7.827 1.00 57.47 

12.253 -20.379 -7.858 1.00 57.64 

10.511 -19.103 -8.482 1.00 55.74 
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19. 
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19 . 
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3919 


CD 
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c 
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18 . 
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-18 .212 
-17 . 057 
-16 .286 
-17 . 603 
-16.550- 
-17 . 611 
-17.550 
-17 . 162 
-16.570 
-16 . 058 
-15 . 501 
-14.324 
-15 . 078 
-17.615 
-17 .340 
-18 . 818 
-19 . 946 
-21.203 
-22 . 195 
-23 .227 
-22 .202 
-23 . 195 
-23 .833 
-20. 162 
-20 . 153 
-20.350 
-20 .586 
-20 . 832- 
-20. 826- 
-21 . 048- 
-19 . 878- 
-19.260- 
-19.706- 
-18 .197- 
-19.434 
-19 . 644 
-18.217 
-17.039 
-15.784 
-15.245- 
-16 .389- 
-14.170 
-16.904 
-16.426 
-17 .329 
-17.272 
-17.415 
-16 . 128 
-16 .474 
-15.307 
-18 .417 
-18 .320 
-19.498 
-20 . 685 
-21.853 
-23 . 010 
-24.053 



-9.306 
-9.889 
-10 . 902 
-10.551 
-11 . 258 
-8.560 
-9 . 097 
-7.329 
-6 . 534 
-5 . 199 
-4 .206 
-4.831 
-2 . 926 
-6 . 282 
-6 .393 
-5 . 939 
-5 . 675 
-5 . 460 
-4 .526 
-4 . 751 
-3 . 173 
-2 . 605 
-3 . 540 
-6 . 861 
-6 . 717 
-8 . 032 
-9 . 271 
10 . 410 
11 . 797 
12 . 874 
13 .732 
13 . 951 
13 . 378 
14 . 746 
-9 . 639 
-9 . 956 
-9 . 607 
-9 . 945 
-9 . 881 
11 .239 
12 . 079 
11.037 
-9 . 006 
402 
761 
775 



1.00 



-9 

-7 , 
-6 , 

-5 .362 
-4 .643 
-3 .371 



317 
058 
776 
619 
959 
8 .304 
7 .311 
7 .490 



1 , 
1. 
1, 
1, 
1. 
1, 
1. 
1, 
1 . 
1. 
1. 
1. 
1. 
1. 
1 . 
1. 
1. 
1 . 



00 
00 
00 
00 
00 
00 
00 
00 
00 
00 
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00 
00 
00 
00 
00 
00 
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1.00 



1 
1 
1 
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1 , 
1 , 
1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
1 . 
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00 
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00 
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00 
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1 . 
1 . 
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1 . 
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1 . 
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00 
00 
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1.00 
1.00 
1 . 00 



53 . 09 
53 .83 
54.55 

52 . 90 
50.45 
50 . 82 
51.28 
48 . 01 

47 .33 
42 .51 
40 . 67 
39.95 
38 . 77 

48 . 87 
46 .71 
51.36 

53 .35 
58 . 64 

62 .94 
64 . 95 
65.49 
68 . 18 
68 . 77 
52 .19 
47.53 
52.37 

52 .90 

54 . 04 
57.52 
58 . 25 
61.43 
64 .32 

63 .22 
65.93 

53 . 09 
53 .06 
50 . 73 
49 .53 
49 .56 
49 . 78 
50.30 
48 . 79 
48 .38 
46.35 
46 . 74 
49.41 
45.20 
40 .19 
41.65 
35 . 91 
54.15 
53 .55 
57.44 
62.46 
61.95 
61.49 
62 . 78 
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5 ATOM 1469 N LYS 492 2.752 19.268 26.783 1.00 46.29 

ATOM 1470 CA LYS 492 1.711 18.351 27.222 1.00 49.90 

ATOM 1471 CB LYS 492 1.772 17.053 26.411 1.00 50.03 

ATOM 1472 CG LYS 492 1.087 17.135 25.062 1.00 53.81 

ATOM 1473 CD LYS 492 -0.002 16.084 24.930 1.00 59.00 

10 ATOM 1474 CE LYS 492 -0.988 16.453 23.827 1.00 61.85 

ATOM 1475 NZ LYS 492 -1.351 15.281 22.976 1.00 62.89 

ATOM 1476 C LYS 492 1.841 18.025 28.701 1.00 51.15 

ATOM 1477 O LYS 492 0.845 17.784 29.379 1.00 53.37 

ATOM 1478 N ALA 493 3.072 18.012 29.199 1.00 50.15 

15 ATOM 1479 CA ALA 493 3.321 17.706 30.600 1.00 49.17 

ATOM 1480 CB ALA 493 4.777 17.314 30.794 1.00 50.39 

ATOM 1481 C ALA 493 2.971 18.885 31.501 1.00 49.36 

ATOM 1487. O ALA 493 3.089 18.799 32.723 1.00 51.57 

ATOM 1483 N GLY 494 2.554 19.989 30.893 1.00 48.61 

20 ATOM 1484 CA GLY 494 2.185 21.159 31.671 1.00 46.92 



1485 C GLY 494 3.322 22.107 32.006 1.00 45.46 

1486 O GLY 494 3.206 22.921 32.919 1.00 43.58 



ATOM 
ATOM 

ATOM 1487 N LEU 495 4.431 22.009 31.284 1.00 44.81 

ATOM 1488 CA LEU 495 5.555 22.899 31.540 1.00 42.34 

25 ATOM 1489 CB LEU 495 6.847 22.293 30.988 1.00 43.79 

ATOM 1490 CG LEU 495 7.712 21.459 31.936 1.00 40.99 

ATOM 1491 CD1 LEU 495 7.022 20.156 32.260 1.00 44.70 

ATOM 1492 CD2 LEU 495 9.072 21.189 31.270 1.00 42.12 

ATOM 1493 C LEU 495 5.278 24.227 30.847 1.00 42.13 

30 ATOM 1494 O LEU 495 4.664 24.258 29.778 1.00 42.49 

ATOM 1495 N THR 496 5.718 25.324 31.452 1.00 42.73 

ATOM 1496 CA THR 496 5.521 26.636 30.845 1.00 43.56 

ATOM 1497 CB THR 496 5.841 27.767 31.829 1.00 46.09 

ATOM 1498 OG1 THR 496 7.222 27.688 32.208 1.00 43.92 

35 ATOM 1499 CG2 THR 496 4.965 27.662 33.064 1.00 45.63 

ATOM 1500 C THR 496 6.471 26.764 29.660 1.00 45.54 

ATOM 1501 O THR 496 7.370 25.939 29.488 1.00 43.39 

ATOM 1502 N LEU 497 6.280 27.800 28.849 1.00 45.02 

ATOM 1503 CA LEU 497 7.135 28.020 27.688 1.00 45.12 

40 ATOM 1504 CB LEU 497 6.710 29.286 26.944 1.00 46.62 

ATOM 1505 -CG LEU 497 5.933 29.080 25.640 1.00 50.20 

ATOM 1506 CD1 LEU 497 5.886 30.397 24.875 1.00 50.95 

ATOM 1507 CD2 LEU 497 6.589 27.990 24.798 1.00 50.91 

ATOM 1508 C LEU 497 8.599 28.135 28.101 1.00 44.94 

45 ATOM 1509 O LEU 497 9.474 27.516 27.493 1.00 45.03 

ATOM 1510 N GLN 498 8.862 28.927 29.137 1.00 41.14 

ATOM 1511 CA GLN 498 10.221 29.101 29.627 1.00 40.54 

ATOM 1512 CB GLN 498 10.246 30.140 30.743 1.00 43.82 

ATOM 1513 CG GLN 498 11.585 30.270 31.437 1.00 43.37 

50 ATOM 1514 CD GLN 498 11.539 31.260 32.584 1.00 47.03 

ATOM 1515 OE1 GLN 498 10.565 31.308 33.332 1.00 49.18 

ATOM 1516 NE2 GLN 498 12.591 32.054 32.727 1.00 45.30 

ATOM 1517 C GLN 498 10.777 27.773 30.145 1.00 39.39 

ATOM 1518 O GLN 498 11.923 27.422 29.866 1.00 35.05 

55 ATOM 1519 N GLN 499 9.965 27.040 30.902 1.00 36.49 

ATOM 1520 CA GLN 499 10.391 25.748 31.434 1.00 36.91 

ATOM 1521 CB GLN 499 9.314 25.155 32.344 1.00 38.84 

ATOM 1522 CG GLN 499 9.155 25.825 33.703 1.00 41.33 

ATOM 1523 CD GLN 499 8.039 25.187 34.512 1.00 42.74 

60 ATOM 1524 OE1 GLN 499 7.027 24.760 33.955 1.00 45.44 

ATOM 1525 NE2 GLN .499 8. 222„__ 25 . 107 35.829 1.00 43.48 
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23 .118 
22.850 
24 . 159 

23 .922 

24 . 881 
22 . 768 
22 . 116 
22 . 681 
20 . 853 
20 . 069 
19 .213 
19 . 193 
18 .804 
18 . 895 
18 . 042 
18.431 
18.395 - 
17.477 - 
19.395 - 
19.095 - 
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-1.299 
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-2 .415 
-2 .585 
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ATOM 5 Hll =47 16.362 -7.796 2.720 X . 00 54.30 

E ill N c S S! S:S S:S? i:°oo I : 

10 E 55S 3 S K S:SS : 7 .:E - • 

ii ss s x a: 3:ss - 

ATOM W34 CZ ARG 548 14.864 -3.178 -0.429 1.00 79.04 

IS JtoM 1935 NH1 ARG 548 15.891 -3.644 -1.128 1.00 80.66 

15 StOM 1936 Zl ARG 548 15.001 -2.063 0.278 1.00 80.39 

ATOM 1937 C ARG 548 13.695 -9.608 0.711 1.00 61.65 

ATOM ^938 O ARG 548 12.500 -9.781 0.466 1.00 62.05 

ATOM 1939 N LEU 549 14.576 -10.603 0.756 1.00 62.39 

00 ATOM ^940 CA LEU 549 14.188 -11.985 0.507 1.00 64.02 

ATOM Htl CB LEU 549 15.433 -12.828 0.195 1.00 62.14 

ATOM 1942 CG LEU 549 16.461 -12.191 -0.753 1.00 60.76 

ATOM lltl CD1 LEU 549 17.699 -13.074 -0.878 1.00 57.77 

ATOM lltl CD 2 LEU 549 15.823 -11.972 -2.108 1.00 58.38 

25 ATOM lltt C LEU 549 13.431 -12.574 1.702 1.00 66.65 

ATOM 1946 O LEU 549 12.759 -13.600 1-577 1.00 67.15 

ATOM lltl » HIS 550 13.541 -11.920 2.856 1.00 67.72 



ATOM 1948 CA HIS 550 12.858 -12.378 4.065 1.00 69.93 

ATOM 1949 CB HIS 550 13.753 -12.190 5.298 1.00. 70.76 

30 ATOM 1950 CG HIS 550 14.977 -13.054 5.306 1.00 71.50 

ATOM 1951 CD 2 HIS 550 15.539 -13.821 4.341 1.00 71.63 

ATOM 1952 ND1 HIS 550 15.793 -13.172 6.411 1.00 71.98 

ATOM llll CE1 HIS 550 16.805 -13.972 6.126 1.00 72.04 

ATOM llll NE2 HIS 550 16.674 -14.379 4.876 1.00 71.39 

35 ATOM 1955 C HIS 550 11-556 -11.603 4.275 1.00 71.15 

ATOM 1956 O HIS 550 10.940 -11.684 5.340 1.00 70.66 

ATOM ill" N ALA 551 11.143 -10.851 3.258 1.00 72.22 

ATOM 1958 CA ALA 551 9.919 -10.057 3.338 1.00 73.58 

ATOM 1959 CB ALA 551 9 904 - .014 2 221 X 0 73 21 

A(\ atom "I96 0 C ALA 551 o .ddo -±u.r^u J • ^ ^ 

ATOM llll O ALA 551 7.684 -10.474 2.621 1.00 76.12 

ATOM 1962 OXT ALA 551' 8.651 -12.025 3.852 1.00 73.79 

HETATM 1963 CIO OHT 600 30.581 1.481 29.471 1.00 26.84 

SSS™ ^964 C9 OHT 600 30.713 -0.043 29.358 1.00 22.85 

45 S 1965 C8 OHT 600 31.366 -0.385 28.037 1.00 25.56 

45 hSaTM 1966 Cll OHT 600 32.761 0.051 27.916 1.00 27.51 

HETATM 1967 C16 OHT 600 33.218 0.797 26.797 1.00 28.35 

SIS™ 1968 CIS OHT 600 34.551 1.237 26.747 1.00 30.39 

SIS™ llsl C14 OHT 600 35.443 0.923 . 27,792 1-00 30.23 

SO SSS™ 1970 C13 OHT 600 35.004 0.185 28.890 1.00 31.45 

SIS™ 197? Cll OHT 600 33.666 -0.241 28.955 1.00 27.93 

SIS™ llll CI OHT 600 30.682 -1.089 27.077 1.00 24.41 

SIS™ 1973 CI OHT 600 29.211 -1.258 27.052 1.00 24.26 

SIS™ 1974 C2 OHT 600 28.644 -2.526 26.706 1.00 25.92 

55 HETATM 1975 C3 OHT 600 27.254 -2.668 26.580 1.00 26.32 

HETATM 1976 C4 OHT 600 26.438 -1.553 26.813 1.00 29.02 

SIS™ 1977 04 OHT 600 25.072 -1.605 26.716 1.00 28.42 

• HETATM 1978 C5 OHT 600 26.980 -0,286 27.130 1.00 26.98 

HETATM 1979 C6 OHT 600 28.362 -0.147 27.231 1.00 25.23 

60 HETATM 1980 C17 OHT 600 31.370 -1.692 25.942 1.00 26.61 

HETATM 1981 CIS OHT 600 32.508 -2.498 26.151 1.00 26.77 
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HOH 48 27.868 -10.898 28.271 1.00 31.27 

HOH 49 6.955 13.568 28.233 1.00 48.83 

HOH 50 22.051 -15.030 28.603 1.00 36.91 

HOH 51 7.026 31.002 30.284 1.00 46.73 

HOH 52 -1.489 12.385 15.164 1.00 51.17 

HOH 53 3.499 6.444 14.452 1.00 50.38 

HOH 54 18.655 -2.048 25.518 1.00 52.29 

HOH 55 28.188 -15.195 38.996 1.00 55.22 

HOH 56 35.275 -10.556 38.061 1.00 57.39 

HOH 57 37.771 -9.103 34.605 1.00 54.17 

HOH 58 31.403 -3.039 17.983 1.00 46.80 

HOH 59 30.455 -6.352 17.005 1.00 47.05 

HOH 60 25.985 8.255 0.416 1.00 43.32 

HOH 61 35.679 0.749 10.462 1.00 42.99 

HOH 62 14.741 4.029 33.936 1.00 49.59 

HOH 63 16.333 2.592 35.952 1.00 45.13 

HOH 64 23.809 7.186 39.798 1.00 45.36 

HOH 65 27.012 -1.948 46.995 1.00 63.39 

HOH 66 25.956 -6.422 42.144 1.00 44.94 

HOH 67 23.510 -8.414 39.036 1.00 39.06 
HOH 
HOH 

HOH 70 30.111 14.823 12.793 1.00 44.58 

HOH 71 26.850 -6.092 1.594 1.00 40.15 

HOH 72 20.448 -3.169 1.055 1.00 42.50 

HOH 73 33.896 3.047 16.172 1.00 46.39 

HOH 74 16.884 0.446 26.043 1.00 61.50 

HOH 75 18.595 0.296 27.866 1.00 47.33 

HOH 76 6.166 21.439 19.124 1.00 47.94 

HOH 77 18.484 20.060 16.232 1.00 35.52 

HOH 78 1.985 23.265 29.187 1.00 46.42 

HOH 79 12.729 30.461 27.530 1.00 62.79 



68 41.475 0.971 33.110 1.00 55.50 

69 36.519 8.863 38.836 1.00 41.56 
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5 WHAT IS CLAIMED IS : 

1 . A method of identifying a compound that modulates coactivator binding to a nuclear 
receptor, said method comprising: 

modeling test compounds that fit spacially into a nuclear receptor coactivator 
1 0 binding site of interest using an atomic structural model of a nuclear receptor coactivator binding 
site or portion thereof, 

screening said test compounds in an assay characterized by binding of a test 
compound to a nuclear receptor coactivator binding site, and 

identifying a test compound that modulates coactivator binding to said nuclear 

1 5 receptor. 

2. The method of claim 1, wherein said atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues of human thyroid receptor selected 
from the group consisting of Val284, Phe293, Ue302, Leu305, and Leu454. 



20 



3. The method of claim 1, wherein said atomic structural model comprises atomic 
coordinates of amino acid residues corresponding to residues of human thyroid receptor selected 
from the group consisting of Val284, Lys288, Ile302, Lys306, Leu454 and Glu457. 



25 4. The method of claim 1, wherein said atomic structural model comprises atomic 

coordinates of amino acid residues corresponding to residues of human thyroid receptor helix 3 
residues Ile280, Thr281, Val283, Val284, Ala287, and Lys2S8, helix 4 residue Phe293, helix 5 
residues Gln301, Ile302, Leu305, Lys306, helix 6 residue Cys309, and helix 12 residues Pro453, 
Leu454, Glu457, Val458 and Phe459. 

30 

5. The method of claim 1, wherein said nuclear receptor coactivator binding site 
comprises amino acid residues corresponding to residues of human thyroid receptor selected from 
the group consisting of helix 3 residues Ile280, Thr2Sl, Val283, Val284, Ala287, and Lys288, helix 
4 residue Phe293, helix 5 residues Gln301, Ile302, Leu305, Lys306, helix 6 residue Cys309, and 
35 helix 12 residues Pro453, Leu454, Glu457, Val458 and Phe459. 



6. The method of claim 5, wherein said amino acid residues corresponding to residues 
of human thyroid receptor comprise Val284, Phe293, Ile302, Leu305 5 and Leu454. 
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7. The method of claim 5, wherein said amino acid residues corresponding to residues 
of human thyroid receptor comprise Val2S4, Lys288, Ile302, Lys306> Leu454 and Glu457. 

8. The method of claim 1, wherein said nuclear receptor coactivator binding site 
10 comprises amino acid residues corresponding to residues of human thyroid receptor of helix 3 

residues Ile2S0, Thr281, Val283, Val284, Ala287, and Lys288, helix 4 residue Phe293, helix 5 
residues Gln301, Ile302, Leu305, Lys306, helix 6 residue Cys309, and helix 12 residues Pro453, 
Leu454, Glu457, Val458 and Phe459. 

15 9. The method of any one of claims 5 through 8, wherein said nuclear receptor is 

selected from the group consisting of TR, RAR, RXR, PPAR, VDR, ER, GR, PR, MR, and AR. 

10. The method of claim 1, wherein said screening is in vitro. 

20 11. The method of claim 10, wherein said screening is high throughput screening. 

12. The method of claim 1, wherein said assay is a biological assay. 

13. The method of claim 1, wherein said test compound is from a library of compounds. 

25 

14. The method of claim 1, wherein said test compound is an agonist or antagonist of 
coactivator binding. 

15. The method of claim 14, wherein said test compound is a small organic molecule, a 
30 peptide, or peptidomimetic. 

16. The method of claim 15, wherein said compound is a peptide comprising a NR-box 
amino acid sequence, or derivative thereof. 

35 17. A method for identifying an agonist or antagonist of coactivator binding to a nuclear 

receptor, said method comprising the steps of: 

providing the atomic coordinates of a nuclear receptor coactivator binding site or 
portion thereof to a computerized modeling system; 
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5 modeling compounds which fit spacially into the nuclear receptor coactivator 

binding site; and 

identifying in an assay for nuclear receptor activity a compound that increases or 
decreases the activity of said nuclear receptor by binding the coactivator binding site of said nuclear 
receptor, whereby an agonist or antagonist of coactivator binding is identified. 

10 

18. A machine-readable data storage medium, comprising a data storage material 
encoded with machine readable data which, when using a machine programmed with instructions 
for using said data, is capable of displaying a graphical three-dimensional representation of a 
molecular complex of a compound bound to a nuclear receptor coactivator binding site comprising 

15 structure coordinates of amino acids corresponding to human thyroid receptor amino acids selected 
from the group consisting of helix 3 residues Ile280, Thr281, Val283, Val284, Ala287, and Lys288, 
helix 4 residue Phe293, helix 5 residues Gln301, Ile302, Leu305, Lys306, helix 6 residue Cys309, 
and helix 12 residues Pro453, Leu454, Glu457, Val458 and Phe459, or a homologue of said 
molecular complex, wherein said homologue comprises a coactivator binding site that has a root 

20 mean square deviation from the backbone atoms of said amino acids of not more than 1 .5 A. 

19. The machine readable storage medium of claim 1 8, wherein said nuclear receptor is 
a thyroid receptor. 

25 20. The machine readable storage medium of claim 19, wherein said thyroid receptor is 

human. 

21. The machine readable storage medium of claim 20, wherein said molecule is peptide 

30 22. The machine readable storage medium of claim 21, wherein said peptide comprises a 

NR-box amino acid sequence, or derivative thereof. 

23. The machine-readable data storage medium according to claim 18, wherein saic 
molecular complex is defined by the set of structure coordinates depicted in Appendix 1, or i 
35 homologue of said molecular complex, said homologue having a root mean square deviation firon 
the backbone atoms of said amino acids of not more than 1 .5 A. 



BNSDOCID: <WO 996001 4A3_IA> 



244 

SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



5 24. A machine-readable data storage medium comprising a data storage materia 

encoded with a first set of machine readable data which, when combined with a second set o 
machine readable data, using a machine programmed with instructions for using said first set of dafc 
and said second set of data, can determine at least a portion of the structure coordinate; 
corresponding to the second set of machine readable data, wherein: said first set of data comprises ; 
10 Fourier transform of at least a portion of the structural coordinates selected from the grouj 
consisting of coordinates depicted in Appendix 1; and said second set of data comprises an X-raj 
diffraction pattern of a molecule or molecular complex. 



15 



25 . A cocrystal of a nuclear receptor comprising a molecule bound to the coactivator 
binding site of said nuclear receptor. 



26. 



The cocrystal of claim 25, wherein said nuclear receptor is a thyroid receptor. 



27. 



The cocrystal of claim 26, wherein said thyroid receptor is human. 



20 



28. 



The cocrystal of claim 27, wherein said molecule is peptide. 



29. The cocrystal of claim 28, wherein said peptide comprises a NR-box amino acid 
sequence or derivative thereof. 



25 



30. A compound identified according to the method of claim 1 . 
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FIGURE 2 
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AEGHSRLHD S KGQTKLLQLLTTKS EQMEP S PLAS 
ADGQSRLHDSKGQTKLLQLLTTKSEQMEPSPLAS 
AEGHSRLHDSKGQTKLLQLLTTKSEQMEPSPLPS 
AENQRGPLESKGHKKLLQLLTCSSEDRGHSSLTN 
AENQRGPLESKGHKKLLQLLTCSSDDRGHSSLTN 
SETPRGPLESKGHKKLLQLLTCSSEDRGHSSLTN 
SEGDSKY- -SQTSHKLVQLLTTTAEQQLRHADID 
. - S KLhQLLT, . 

PGSTHGTSLKEKHKILHRLLQDSSSPVDLAKLTA 
SGSTHGTSLKEKHKIIJIRLLQDSSSPVDLAKLTA 
PGSTHGTSLKEKHKILHRLLQDSSSPVDLAKLTA 
TSNMHGSLLQEKHRILHKLLQNGNSPAEVAKITA 
T SNMHGS LLQEKHRI LHKLLQNGNS P AEVAKI TA 
TSNVHGSLLQEKHRILHKLLQNGNSPAEVAKITA 
TCPSSHSSLTERHKILHRLLQE • GSPSDITTLSV 
L.E+H+ILH+LLQ. . .SP.-h. .h. . 

EPASPKKKE NALLRYLLDKDDTKDIGLPEIT 

EPVSPKKKE NAIiLRYLLDKDDTKDIGLPEIT 

EPASPKKKE NALLRYLLDKDDTKDIGLPSIT 

EQLSPKKKE- -NNALLRYLLDRDDPSDVLAKKLQ 
EQLSPKKKE - -NNALLRYLLDRDDPSDALSKELQ 
EQLSPKKKE - -NNALLRYLLDRDDPSDALSKELQ 
ELDAAKKKESKDHQLLRYLLDKDEKDLRSTPNLC 
E • • . . KKKE LLRYLLD+D- h. 
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FIGURE 10c 



120 r 



100 - 



Q 

m 

C9 
QC 

I- 

•a 
c 

3 

o 

CD 




.1 



1 10 100 

Peptide (pM) 



1000 



A 


Residues 8027 of SEQ ID No: 7 


PKKKENA 


[llryll 


DKDDTKD 




Residues 1 1-27 of SEQ ID No: 6 


TSLKEKHK 


I LLHRLL I 


QDSS 


!A 


SEQ ID No: 29 


TSLKEKHK 


LLRYLL 


QDSS « 



BNSDOCID: <WO 996001 4A3.IA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



16/26 



PCT/US99/06899 



FIGURE 11a 
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<210> 1 

<211> 5 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Leu Xaa Xaa Leu Leu 
1 5 



<210> 2 
<211> 6 
<212> PRT 

<213 > Homo sapiens 
<400> 2 

lie Leu Xaa Xaa Leu Leu 
1 5 



<210> 3 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Phe Xaa Xaa Leu Trp 
1 5 



<210> 4 
<211> 5 
<212> PRT 

<213> Homo sapiens 
<400> 4 
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Phe Xaa Xaa Ala Leu 
1 5 



<210> 5 

10 <211> 34 

<212> PRT 

<213> Homo sapiens 



<400> 5 

15 Ala Glu Gly His Ser Arg Leu His Asp Ser Lys Gly Gin Thr Lys Leu 
1 5 .10 15 



20 



Leu Gin Leu Leu Thr Thr Lys Ser Glu Gin Met Glu Pro Ser Pro Leu 
20 25 30 

Ala Ser 



25 <210> 6 
<211> 34 
<212> PRT 

<213> Homo sapiens 

30 <220> 

<221> MUTAGEN 

<222> (15) 

<223> lie --> Ala 

35 <220> 

<221> MUTAGEN 

<222> (16) 

<223> Leu --> Ala 

40 <220> 

<2 21> MUTAGEN 

<222> (19) 

<22 3> Leu --> Ala 

45 <220> 

<221> MUTAGEN 

<222> (20) 

<22 3> Leu — > Ala 

50 <220> 

<221> MUTAGEN 
<222> (16) . . (20) 

<223> Leu(16) -~> Ala; Leu(20) - - > Ala 

55 <220> 

<221> MUTAGEN 

<222> (15) . . (16) 

<223> LeuLeu AlaAla 

60 <220>- 

<221> MUTAGEN 

<222> (17) . . (18) 

<223> HisArg --> AlaAla 

65 <220> 

<221> MUTAGEN 
<222> (19) . . (20) 

2/15 
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5 <223> LeuLeu - - > AlaAla 
<220> 

<221> MUTAGEN 
<222> (15) 
10 <223> lie --> Phe 

<220> 

<221> MUTAGEN 
<222> (16) 
15 <223> leu --> Phe 

<220> 

<221> MUTAGEN 
<222> (19) 
20 <223> Leu Phe 

<220> 

<221> MUTAGEN 

<222> (20) 

25 <223> Leu Phe 

<400> 6 

Pro Gly Ser Thr His Gly Thr Ser Leu Lys Glu Lys His Lys lie Leu 
15 10 15 



30 



35 



His Arg Leu Leu Gin Asp Ser Ser Ser Pro Val Asp Leu Ala Lys Leu 
20 25 30 

Thr Ala 



<210> 7 
<211> 31 
40 <212> PRT 

<213> Homo sapiens 

<400> 7 

Glu Pro Ala Ser Pro Lys Lys Lys Glu Asn Ala Leu Leu Arg Tyr Leu 
45 1 5 10 15 

Leu Asp Lys Asp Asp Thr Lys Asp lie Gly Leu Pro Glu lie Thr 
20 25 30 

50 - 

<210> 8 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 8 

Ala Asp Gly Gin Ser Arg Leu His Asp Ser Lys Gly- Gin Thr Lys Leu 
15 10 15 

60 Leu Gin Leu Leu Thr Thr Lys Ser Glu Gin Met Glu Pro Ser Pro Leu 

20 25 30 



55 



65 



Ala Ser 



<210> 9 
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5 <211>- 34 
<212> PRT 

<213> Homo sapiens 
<400> 9 

10 Ser Gly Ser Thr His Gly Thr Ser Leu Lys Glu Lys His Lys lie Leu 
1 5 10 15 



15 



His Arg Leu Leu Gin Asp Ser Ser Ser Pro Val Asp Leu Ala Lys Leu 
20 25 30 

Thr Ala 



20 <210> 10 
<211> 31 
<212> PRT 

<213> Homo sapiens 
25 <400> 10 

Glu Pro Val Ser Pro Lys Lys Lys Glu Asn Ala Leu Leu Arg Tyr Leu 
15 10 15 

Leu Asp Lys Asp Asp Thr Lys Asp lie Gly Leu Pro Glu lie Thr 
30 20 25 30 



<210> 11 
<211> 34 
<212> PRT 

<213> Homo sapiens 



40 



<400> 11 

Ala Glu Gly His Ser Arg Leu His Asp Ser Lys Gly Gin Thr Lys Leu 
15 10 15 



45 



Leu Gin Leu Leu Thr Thr Lys Ser Glu Gin Met Glu Pro Ser Pro Leu 
20 25 30 

Pro Ser 



<210> 12 
50 <211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 12 

55 Pro Gly Ser Thr His Gly Thr Ser Leu Lys Glu Lys His Lys lie Leu 
15 10 15 



60 



His Arg Leu Leu Gin Asp Ser Ser Ser Pro Val Asp Leu Ala Lys Leu 
20 25 30 

Thr Ala 



65 



<210> 13 
<211> 31 
<212> PRT 
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10 



15 



20 



25 



<400> 13 

Glu Pro Ala Ser Pro Lys Lys Lys Glu Asn Ala Leu Leu Arg Tyr Leu 

5 10 is 

Leu Asp Lys Asp Asp Thr Lys Asp lie Gly Leu Pro Ser lie Thr 
20 25 30 

<210> 14 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 14 

Ala Glu Asn Gin Arg Gly Pro Leu Glu Ser Lys Gly His Lys Lys Leu 
15 10 15 

Leu Gin Leu Leu Thr Cys Ser Ser Glu Asp Arg Gly His Ser Ser Leu 
20 25 30 

Thr Asn 



30 



40 



<210> 15 

<211> 34 

<212> PRT 

<213> Homo sapiens 

<400> 15 

Thr Ser Asn Met His Gly Ser Leu Leu Gin Glu Lys His Arg lie Leu 

5 10 15 

His Lys Leu Leu Gin Asn Gly Asn Ser Pro Ala Glu Val Ala Lys lie 
20 25 3 0 

Thr Ala 



45 



50 



55 



<210> 16 

<211> 32 

<212> PRT 

<213> Homo sapiens 

<400> 15 

Glu Gin Leu Ser Pro Lys Lys Lys Glu Asn Asn Ala Leu Leu Arg Tyr 

5 10 is 

Leu Leu Asp Arg Asp Asp Pro Ser Asp Val Leu Ala Lys Lys Leu Gin 
20 25 30 



60 



<210> 17 
<211> 34 
65 <212> PRT 

<213> Homo sapiens 
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5 <400> 17 _ 

Ala Glu Asn Gin Arg Gly Pro Leu Glu Ser Lys Gly Hxs Lys Lys Leu 
± 5 10 15 

Leu Gin Leu Leu Thr Cys Ser Ser Asp Asp Arg Gly His Ser Ser Leu 
10 20 25 30 

Thr Asn 



15 

<210> 18 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<400> 18 

Thr Ser Asn Met His Gly Ser Leu Leu Gin Glu Lys Hxs Arg lie Leu 
1 5 10 15 

25 His Lys Leu Leu Gin Asn Gly Asn Ser Pro Ala Glu Val Ala Lys He 

20 25 30 



20 



30 



40 



Thr Ala 



<210> 19 
<211> 32 
<212> PRT 
35 <213> Homo sapiens 

<400> 19 

Glu Gin Leu Ser Pro Lys Lys Lys Glu Asn Asn Ala Leu Leu Arg Tyr 
. i 5 10 15 



Leu Leu Asp Arg Asp Asp Pro Ser Asp Ala Leu Ser Lys Glu Leu Gin 
20 25 30 



45 



<210> 20 
<211> 34 
50 <212> PRT 

<213> Homo sapiens 

<400> 20 

Ser Glu Thr Pro Arg Gly Pro Leu Glu Ser Lys Gly His Lys Lys Leu 
55 1 5 10 15 

Leu Gin Leu Leu Thr Cys Ser Ser Glu Asp Arg Gly His Ser Ser Leu 
20 25 30 

60 Thr Asn 



<210> 21 
65 <211> 34 
<212> PRT 

<213> Homo sapiens 

6/15 



BNSDOCID: <WO 996001 4A3JA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



<400> 21 

Thr Ser Asn Val His Gly Ser Leu 
1 5 

His Lys Leu Leu Gin Asn Gly Asn 
20 

Thr Ala 



Leu Gin Glu Lys His Arg lie Leu 
10 15 

Ser Pro Ala Glu Val Ala Lys lie 
25 30 



<210> 22 
<211> 32 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Glu Gin Leu Ser Pro Lys Lys Lys 
1 5 

Leu Leu Asp Arg Asp Asp Pro Ser 
20 



Glu Asn Asn Ala Leu Leu Arg Tyr 
10 15 

Asp Ala Leu Ser Lys Glu Leu Gin 
25 30 



<210> 23 
<211> 32 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Ser Glu Gly Asp Ser Lys Tyr Ser 
1 5 

Leu Leu Thr Thr Thr Ala Glu Gin 
20 



Gin Thr Ser His Lys Leu Val Gin 
10 is 

Gin Leu Arg His Ala Asp lie Asp 
25 30 



<210> 24 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 24 

Thr Cys Pro Ser Ser His Ser Ser 
1 5 

His Arg Leu Leu Gin Glu Gly Ser 
20 

Val 



Leu Thr Glu Arg His Lys lie Leu 
10 15 

Pro Ser Asp lie Thr Thr Leu Ser 
25 30 



<210> 25 
<211> 34 
<212> PRT 
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5 <213> Homo sapiens 

Glu°Leu 5 Asp Ala Ala Lys Lys Lys Glu Ser Lys Asp His Gin Leu Leu 
1 5 10 15 



10 



15 



Arg Tyr Leu Leu Asp Lys Asp Glu Lys Asp Leu Arg Ser Thr Pro Asn 



20 25 

Leu Cys 



<210> 26 
<211> 34 
20 <212> PRT 

<213> Homo sapiens 

<220> 

<221> SITE 
25 <222> (2) 

<223> negatively charged amino acid 

<220> 

<221> SITE 
30 <222> (17) 

<223> hydrophobic amino acid 

<220> 

<221> SITE 
35 <222> (25) 

<223> negatively charged amino acid 

Jaa°xaa 6 Xaa Xaa Xaa Xaa Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Lys Leu 
40 1 5 10 15 

Xaa Gin Leu Leu Thr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

45 Xaa Xaa 



<210> 27 
50 <211> 34 
<212> PRT 

<213> Homo sapiens 

<220> 

55 <221> SITE 

<222> (12) 

<223> positively charged amino acid 

<220> 
60 <221> SITE 
<222> (14) 

<223> positively charged amino acid 

<220> 
65 <221> SITE 
<222> (18) 

<223> positively charged amino acid 
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5 



<220> 

<221> SITE 
<222> (28) 

<223> negatively charged amino acid 
<220> 

<221> SITE 
<222> (29) 

<223> hydrophobic amino acid 
<220> 

<221> SITE 
<222> (32) 

<22 3> hydrophobic amino acid 
<400> 27 



<210> 28 
<211> 34 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> SITE 
<222> (22) 

<223> positively charged amino acid 
<220> 

<221> SITE 
<222> (24) 

<22 3> negatively charged amino acid 
<220> 

<221> SITE 
<222> (33) 

<223> hydrophobic amino acid 
<400> 28 



25 




Xaa Xaa 



30 



55 




Xaa xaa 



60 



65 



<210> 29 
<211> 18 
<212> PRT 



<213> Homo sapiens 



9/15 



BNSDOCID: <WO. 



.99600 14A3JA> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/060014 



PCT/US99/06899 



Th£°Ser 9 Leu Lys Glu Lys His Lys Leu Leu Arg Tyr Leu Leu Gin Asp 
15 10 15 



Ser Ser 

10 



<210> 30 
<211> 33 
15 <212> PRT 

<213> Homo sapiens 



<220> 

<221> MUTAGEN 
20 <222> (5) 

<223> Thr --> Arg (T281R) 

<220> 

<221> MUTAGEN 
25 <222> (8) 

<223> Val --> Arg (V284R) 

<220> 

<221> MUTAGEN 

30 <222> (9) 

<223> Asp Ala (D285A) 

<220> 

<221> MUTAGEN 
35 <222> (12) 

<223> Lys --> Ala (K288A) 

<220> 

<221> MUTAGEN 

40 <222> (22) 

<223> Cys Arg (C298R) 

<220> 

<221> MUTAGEN 
45 <222> (26) 

<223> lie --> Arg (I302R) 



<220> 

<221> MUTAGEN 
50 <222> (30) 

<223> Lys --> Ala (K306A) 



<400> 30 n _ 
Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Lys Leu Pro Met 
55 1 5 10 15 

Phe Cys Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly Cys 
20 25 30 

60 Cys 



<210> 31 
65 <211> 12 
<212> PRT 
<213> Homo sapiens 
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10 



15 



20 



25 



30 



40 



50 



60 



65 



<220> 

<221> MUTAGEN 
<222> (5) 

<223> Leu Arg (L454R) 

<220> 

<221> MUTAGEN 
<222> (7) 

<223> Leu --> Arg (L456R) 
<220> 

<221> MUTAGEN 
<222> (8) 

<223> Glu Lys (E457K) 

<400> 31 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
15 10 



<210> 32 
<211> 33 
<212> PRT 

<213> Homo sapiens 
<400> 32 

Thr Pro Ala lie Thr Arg Val Val Asp Phe Ala Lys Lys Leu Pro Met 
15 10 15 

35 Phe Ser Glu Leu Pro Cys Glu Asp Gin lie lie Leu Leu Lys Gly Cys 

20 25 30 



Cys 



<210> 33 
<211> 12 
<212> PRT 
45 <213> Homo sapiens 



<400> 33 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
15 10 



<210> 34 

<211> 33 

<212> PRT 

55 <213> Homo sapiens 

<400> 34 

Thr Lys Cys lie lie Lys lie Val Glu Phe Ala Lys Arg Leu Pro Gly 
1 5 10 15 



Phe Thr Gly Leu Ser lie Ala Asp Gin lie Thr Leu Leu Lys Ala Ala 
20 25 30 

Cys 
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5 <210> 35 

<211> 12 

<212> PRT 

<213> Homo sapiens 

10 <400> 35 

Leu Phe Pro Pro Leu Phe Leu Glu Val Phe Glu Asp 
15 10 

15 <210> 36 
<211> 33 
<212> PRT 

<213> Homo sapiens 
20 <400> 36 

Asp Lys Gin Leu Phe Thr Leu Val Glu Trp Ala Lys Arg lie Pro His 
15 10 15 

Phe Ser Glu Leu Pro Leu Asp Asp Gin Val He Leu Leu Lys Ala Gly 
25 20 25 30 

Trp 



30 



35 



40 



45 



55 



65 



<210> 37 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 37 

Pro He Asp Thr Phe Leu Met Glu Met Leu Glu Ala 
15 10 



<210> 38 
<211> 33 
<212> PRT 

<2X3> Homo sapiens 
<400> 38 

Val Glu Ala Val Gin Glu He Thr Glu Tyr Ala Lys Asn lie -Pro Gly 
15 10 15 

50 Phe He Asn Leu Asp Leu Asn Asp Gin Val Thr Leu Leu Lys Tyr Gly 

20 25 30 



Val 



<210> 39 
<211> 12 
<212> PRT 
60 <213> Homo sapiens 

<400> 39 

Ser Leu His Pro Leu Leu Gin Glu He Tyr Lys Asp 
15 10 



<210> 40 
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<211> 33 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Ser Tyr Ser He Gin Lys Val He Gly Phe Ala Lys Met He Pro Gly 



10 



IS 



Phe Arg Asp Leu Thr Ser Glu Asp Gin He Val Leu Leu Lys Ser Ser 
Ala 



15 20 25 30 



20 <210> 41 
<211> 12 
<212> PRT 

<213> Homo sapiens 

25 <400> 41 

Lys Leu Thr Pro Leu Val Leu Glu Val Phe Gly Asn 



5 



10 



30 <210> 42 
<211> 33 
<212> PRT 

<213> Homo sapiens 

35 <220> 

<221> MUTAGEN 
<222> (12) 

<223> Lys --> Ala (K362A) 

40 <220> 

<221> MUTAGEN 
<222> (26) 

<223> Val > Arg (V3 76R) 

45 <400> 42 

Asp Arg Glu Leu Val His Met He Asn Trp Ala Lys Arg Val Pro Gly 

5 10 15 



Phe Val Asp Leu Thr Leu His Asp Gin Val His Leu Leu Glu Cys Ala 
20 25 30 

Trp 



<210> 43 
<211> 12 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> MUTAGEN 
<222> (8) 

<223> Glu Lys (E542K) 

<400> 43 

Pro Leu Tyr Asp Leu Leu Leu Glu Met Leu Asp Ala 
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10 



<210> 44 
<211> 33 
10 <212> PRT 

<213> Homo sapiens 

<400> 44 

Gly Arg Gin Val lie Ala Ala Val Lys Trp Ala Lys Ala He Pro Gly 
15 1 5 10 

Phe Arg Asn Leu His Leu Asp Asp Gin Met Thr Leu Leu Gin Tyr Ser 
20 25 30 

20 Trp 



<210> 45 
25 <211> 12 
<212> PRT 

<213> Homo sapiens 
<400> 45 

30 Glu Phe Pro Glu Met Leu Ala Glu He He Thr Asn 



<210> 46 
35 <211> 33 
<212> PRT 

<213> Homo sapiens 

<400> 46 ^ _ 

40 Glu Arg Gin Leu Leu Ser Val Val Lys Trp Ser Lys Ser Leu Pro Gly 
x 5 10 15 

Phe Arg Asn Leu His He Asp Asp Gin He Thr Leu He Gin Tyr Ser 
20 25 30 



45 



Trp 



50 <210> 47 
<211> 12 
<212> PRT 

<213> Homo sapiens 

55 <400> 47 

Glu Phe Pro Glu Met Met Ser Glu Val He Ala Ala 
1 5 10 



60 <210> 48 
<211> 33 
<212> PRT 

<213> Homo sapiens 



65 <400> 48 

Gly Lys Gin Met He Gin Val Val Lys Trp Ala Lys Val Leu Pro Gly 
15 10 15 
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Phe Lys Asn Leu Pro Leu Glu Asp Gin lie Thr Leu He Gin Tyr Ser 
20 25 30 

Trp 



<210> 49 
<211> 12 
15 <212> PRT 

<213> Homo sapiens 

<400> 49 

Glu Phe Pro Ala Met Leu Val Glu He He Ser Asp 



20 i 



5 10 



<210> 50 
<211> 33 
25 <212> PRT 

<213> Homo sapiens 

<400> 50 

Glu Arg Gin Leu Val His Val Val Lys Trp Ala Lys Ala Leu Pro Gly 
1 5 10 15 

Phe Arg Asn Leu His Val Asp Asp Gin Met Ala Val He Gin Tyr Ser 
20 2 5 30 

35 Trp 



<210> 51 
40 <211> 12 
<212> PRT 

<213> Homo sapiens 



<400> 51 

Asp Phe Pro Glu Met Met Ala Glu He He Ser Val 
1 5 10 
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